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BLASTP 2.2.7 [Jan-02-2004] 



Reference : 

Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search 
programs", Nucleic Acids Res. 25:3389-3402. 

RID: 1073935785-25692-185772007425. BLASTQ4 
Query= 

(398 letters) 



Database: All non-redundant GenBank CDS 
trans lations+PDB+SwissProt+PIR+PRF 

1,591,231 sequences; 521,258,069 total letters 

If you have any problems or questions with the results of this search 
please refer to the BLAST FAQs 

Taxonomy reports 



Distribution of 50 Blast Hits on the Query Sequence 



Mouse-over to show defline and scores. Click to show alignments^ 
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Score 



Sequences producing significant alignments: 



qi|168764351ref I NP 473362.11 
qi I 21929156 I dbi I BAC06152 . 1 1 

qil38086312|ref |XP 141764.31 

qi I 34881783 I ref I XP 229186.2 | 

qi|24476016|ref |NP 722561.11 

qi I 6677701 1 ref |NP 031395.11 
qi 129611554 I qb I AAO85088.il 



G protein-coupled receptor 101. 
seven transmembrane helix recep. 
similar to G protein-coupled r. 
similar to G protein-coupled r. 
G protein-coupled receptor 161. 
G protein-coupled receptor 161;. 
G protein-coupled receptor GPR10. 
qil 8843925 Iqb I AAF80168. 1 1 alpha la-adrenoceptor isoform 2 [. 
qi 1 3023219 I sp I OQ2824 I A1AA RABIT Alpha-IA adrenergic recepto. 
qi I 26788019 1 embl CAC94897 . 2 | SI:bZ20I5.4 (novel protein simi. 
qi I 8843927 1 qb 1 AAF80169 . 1 1 alpha la-adrenoceptor isoform 3 [. 
qi 1 15004694 I qb I AAK77197 . 1 1 adrenergic receptor alpha-la [Ho. 
qi!154517591ref 1NP 150646.11 alpha-lA-adrenergic receptor i. 
qi 1 15451761 1 ref I NP 150647 . 1 1 alpha-lA-adrenergic receptor i. 
qi 14501961 1 ref 1NP 000671.11 alpha-lA-adrenergic receptor is. 
qi I 666893 I qb I AAB59486 . 1 1 alpha-lC-adrenergic receptor 
qi 1 15451757 | ref 1NP 150645.1 1 alpha-lA-adrenergic receptor i. 
qi|1168246|sp|P35348IAlAA HUMAN Alpha-IA adrenergic recepto, 
qi|1168247|sp|P43140IAlAA RAT Alpha-IA adrenergic receptor , 
qil 31542114 I ref I NP 0384 89 . 2 I adrenergic receptor, alpha la;, 
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qi|20141255|splP977181AlAA MOUSE Alpha-IA adrenergic recept . 

qi 1 26351717 1 dbj I BAC394 95. 1 [ unnamed protein product [Mus mu. 
qi|2494939lsp|Q91175|AlAA ORYLA Alpha-IA adrenergic recepto. 
qi I 7441613 Ipir [ IS71323 alpha-lA adrenergic receptor - Japan. 

qi I 8392870 I ref I NP 058887.11 adrenergic receptor, alpha la; . 

qi | 409029 I qbl AAA93114 . 1 1 alphalC adrenergic receptor 

qi I 547222 I qb| AAB31165 . 1 1 alpha adrenergic receptor subtype . 

qi 1 14 9165191 splQ9WU25 | A1AA_CAVP0 Alpha-IA adrenergic recept. 

qi 1278062131 ref I NP_ 77 6923 . 1 1 adrenergic, alpha 1A, receptor. 
qi I 656338 6 I emb 1 CAB6257 0 . 1 | alpha-lA adrenergic receptor [Su. 

qi 1 4501957 | ref 1 NP 000669.11 alpha-lD-adrenergic receptor; a. 

qi I 7 690135 I qb 1 AAB31 1 63 . 2 | alpha adrenergic receptor subtype. 

qi 1 86790 I pir | IJH0447 alpha-lA-adrenergic receptor - human >. 
qi I 4 03627 63 I qb | AAR84 650 . 1 1 alpha 1A adrenoceptor isoform 6 . 

qi 1 40362753 1 qb I AAR84 645.1 1 alpha 1A adrenoceptor isoform 2c. 
qi | 63957 3 I qb I AAB30835 . 1 1 alpha lc-adrenoceptor, alpha lc-AR. 

qi 1 4 03627 57 1 qb | AAR84 647 . 1 | alpha 1A adrenoceptor isoform 3c. 
qi | 40362751 1 qbl AAR8 4 64 4 . 1 1 alpha 1A adrenoceptor isoform 2b. 
qi 1 403627 59 I qbl AAR84 64 8 . 1 | alpha 1A adrenoceptor isoform 5a. 

qi | 478273 Ipir I | JC1525 alpha-lB-adrenergic receptor - rat >g. 

qi 1 8392867 | ref |NP 058687.11 adrenergic receptor, alpha lb; . 
qil5902693|sp|O77621|AlAA CAN FA Alpha-IA adrenergic recepto. 

qi 1 11682451 spl P35368 I A1AB HUMAN Alpha-IB adrenergic recepto. 
qi 1112871 lsp| P18841 IA1AB MESAU Alpha-IB adrenergic receptor. 

qi|45019591ref |NP 000670.11 alpha-lB-adrenergic receptor; a. 
qi I 547221 | qb I AAB31 164 . 1 1 alpha adrenergic receptor subtype . 
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Alignments 



> qi| 16876435 I ref |NP 473362.11 D G protein-coupled receptor 101 [Homo sapiens] 
qi 1 16566341 1 qb 1 AAL26486 . 1 1 D G protein-coupled receptor [Homo sapiens] 
qi I 20152240 1 dbj I BAB89301 . 1 1 putative G-protein coupled receptor [Homo sapiens] 
Length = 508 

Score = 495 bits (1274), Expect = e-139 

Identities = 242/242 (100%), Positives = 242/242 (100%) 

Query: 1 MTSTCTNSTRESNSSHTCMPLSKMPISLAHGIIRSTVLVIFLAAS FVGNIVLALVLQRKP 60 

MTSTCTNSTRESNSSHTCMPLSKMPISLAHGIIRSTVLVIFLAASFVGNIVLALVLQRKP 
Sbjct: 1 MTSTCTNSTRESNSSHTCMPLSKMPISLAHGIIRSTVLVIFLAAS FVGNIVLALVLQRKP 60 

Query: 61 QLLQVTNRFIFNLLVTDLLQISLVAPWWATSVPLFWPLNSHFCTALVSLTHLFAFASVN 120 

QLLQVTNRFIFNLLVTDLLQISLVAPWWATSVPLFWPLNSHFCTALVSLTHLFAFASVN 
Sbjct: 61 QLLQVTNRFIFNLLVTDLLQISLVAPWWATSVPLFWPLNSHFCTALVSLTHLFAFASVN 120 

Query: 121 TIVWSVDRYLSIIHPLSYPSKMTQRRGYLLLYGTWIVAILQSTPPLYGWGQAAFDERNA 180 

TIVWSVDRYLSIIHPLSYPSKMTQRRGYLLLYGTWIVAILQSTPPLYGWGQAAFDERNA 
Sbjct: 121 TIVWSVDRYLSIIHPLSYPSKMTQRRGYLLLYGTWIVAILQSTPPLYGWGQT^AFDERNA 180 

Query: 181 LCSMIWGASPSYTILSWSFIVIPLIVMIACYSWFCAARRQHALLYNVKRHSLEVRVKD 240 

LCSMIWGASPSYTILSWSFIVIPLIVMIACYSWFCAARRQHALLYNVKRHSLEVRVKD 
Sbjct: 181 LCSMIWGAS PS YTILS WS FI VI PLI VMI ACYS WFCAARRQHALLYNVKRHSLEVRVKD 240 
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Query: 241 CV 242 
CV 

Sbjct: 241 CV 242 



Score = 231 bits (589), Expect = 2e-59 

Identities = 109/139 (78%), Positives = 110/139 (79%) 

Query: 260 MNIPEXXXXXXXXXXXXXXXXXCYQCKAAKVIFIIIFSYVLSLGPYCFLAVLAVWVDVET 319 

+NIPE CYQCKAAKVIFIIIFSYVLSLGPYCFLAVLAVWVDVET 
Sbjct: 370 VNIPESLPPSRRNSNSNPPLPRCYQCKAAKVIFIIIFSYVLSLGPYCFLAVLAVWVDVET 429 

Query: 320 QVPQWVITIIIWLFFLQCCIHPYVYGYMHKTIKKEIQDMLXXXXXXXXXXXXDSHPDLPG 379 

QVPQWVITIIIWLFFLQCCIHPYVYGYMHKTIKKEIQDML DSHPDLPG 
Sbjct: 430 QVPQWVITIIIWLFFLQCCIHPYVYGYMHKTIKKEIQDMLKKFFCKEKPPKEDSHPDLPG 489 

Query: 380 TEGGTEGKIVPSYDSATFP 398 

TEGGTEGKIVPSYDSATFP 
Sbjct: 4 90 TEGGTEGKIVPSYDSATFP 508 



> qi I 21929156 I dbj 1 BAC06152 . 1 1 seven transmembrane helix receptor [Homo sapiens] 
Length = 485 

Score - 447 bits (1150), Expect = e-124 

Identities = 219/219 (100%), Positives = 219/219 (100%) 

Query: 24 MPISLAHGIIRSTVLVIFLAASFVGNIVLALVLQRKPQLLQVTNRFIFNLLVTDLLQISL 83 

MPISLAHGIIRSTVLVIFLAASFVGNIVLALVLQRKPQLLQVTNRFIFNLLVTDLLQISL 
Sbjct: 1 MPI SLAHGI I RST VLVI FLAAS FVGNI VLALVLQRKPQLLQVTNRFI FNLLVTDLLQI SL 60 

Query 84 VAPWWATSVPLFWPLNSHFCTALVSLTHLFAFASVNTIVWSVDRYLSIIHPLSYPSKM 143 

VAPWWATSVPLFWPLNSHFCTALVSLTHLFAFASVNTIVWSVDRYLSIIHPLSYPSKM 
Sbjct: 61 VAPWWATSVPLFWPLNSHFCTALVSLTHLFAFASVNTIVWSVDRYLSIIHPLSYPSKM 120 

Query: 144 TQRRGYLLLYGTWIVAILQSTPPLYGWGQAAFDERNALCSMIWGASPSYTILSWSFIVI 203 

TQRRG YLLL YGTW I VAI LQST P PL YGWGQAAFDERNALCSM I WGAS PSYTI LS WS FI VI 
Sbjct: 121 TQRRGYLLLYGTWIVAILQSTPPLYGWGQAAFDERNALCSMIWGASPSYTILSWSFIVI 180 

Query: 204 PLIVMIACYSWFCAARRQHALLYNVKRHSLEVRVKDCV 242 

PLIVMIACYSWFCAARRQHALLYNVKRHSLEVRVKDCV 
Sbjct: 181 PL I VMI AC YS WFCAARRQHALLYNVKRHS LEVRVKDCV 219 



Score = 231 bits (590), Expect = le-59 

Identities = 109/139 (78%), Positives = 110/139 (79%) 

Query: 260 MNIPEXXXXXXXXXXXXXXXXXCYQCKAAKVIFIIIFSYVLSLGPYCFLAVLAVWVDVET 319 

+NIPE CYQCKAAKVI FI II FS YVLSLGPYCFLAVLAVWVDVET 

Sbjct: 347 VNIPESLPPSRRNSNSNPPLPRCYQCKAAKVIFI I I FS YVLSLGPYCFLAVLAVWVDVET 406 

Query: 320 QVPQWVITIIIWLFFLQCCIHPYVYGYMHKTIKKEIQDMLXXXXXXXXXXXXDSHPDLPG 37 9 

QVPQWVITIIIWLFFLQCCIHPYVYGYMHKTIKKEIQDML DSHPDLPG 
Sbjct: 407 QVPQWVITIIIWLFFLQCCIHPYVYGYMHKTIKKEIQDMLKKFFCKEKPPKEDSHPDLPG 466 

Query: 380 TEGGTEGKIVPSYDSATFP 398 
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TEGGTEGKIVPSYDSATFP 
Sbjct: 467 TEGGTEGKIVPSYDSATFP 485 



> qi | 38086312 | re f I XP 1417 64 . 3 I O similar to G protein-coupled receptor 101 [Mus mus 
Length = 615 

Score = 404 bits (1037), Expect = e-111 

Identities - 189/242 (78%), Positives - 212/242 (87%) 

Query: 1 MTSTCTNSTRESNSSHTCMPLSKMPISLAHGIIRSTVLVIFLAASFVGNIVLALVLQRKP 60 

M +CTNST+E+N S C+PLSKMPIS+AHGIIRS VL+ + L +F+GN+VL VL RKP 
Sbjct: 105 MPPSCTNSTQENNGSRVCLPLSKMPISVAHGIIRSVVLLVILGVAFLGNVVLGYVLHRKP 164 

Query: 61 QLLQVTNRFIFNLLVTDLLQISLVAPWVVATSVPLFWPLNSHFCTALVSLTHLFAFASVN 120 

LLQVTNRFIFNLLVTDLLQ++LVAPWW+T++P FWPLN HFCTALVSLTHLFAFASVN 
Sbjct: 165 NLLQVTNRFIFNLLVTDLLQVALVAPWVVSTAIPFFWPLN I HFCTALVSLTHLFAFASVN 224 

Query: 121 TIVVVSVDRYLSIIHPLSYPSKMTQRRGYLLLYGTWIVAILQSTPPLYGWGQAAFDERNA 180 

TIVVVSVDRYL+IIHPLSYPSKMT RR Y+LLYGTWI A LQSTPPLYGWG A FD+RNA 
Sbjct: 225 TIVWSVDRYLTIIHPLSYPSKMTNRRSYILLYGTWIAAFLQSTPPLYGWGHATFDDRNA 284 

Query: 181 LCSMIWGAS PS YT I LS WS FI VI PL I VMI AC YS VVFCAARRQHALLYNVKRHSLEVRVKD 240 

CSMIWGASP+YT++SVVSF+VIPL VMIACYSWF AARRQ ALLY K H LEVRV+D 
Sbjct: 285 FCSMIWGASPAYTVVSVVSFLVIPLGVMIACYSVVFGAARRQQALLYKAKSHRLEVRVED 344 

Query: 241 CV 242 
V 

Sbjct: 345 SV 346 



Score « 195 bits (496), Expect = le-48 

Identities = 93/143 (65%), Positives = 104/143 (72%), Gaps = 4/143 (2%) 

Query: 260 MNIPEXXXXXXXXXXXXXXXXXCYQCKAAKVIFIIIFSYVLSLGPYCFLAVLAVWVDVET 319 

M IPE CY+CKAA+VIF+II +YVLSLGPYCFLAVLAVWVD++T 

Sbjct: 473 MRIPESSPPSRRNSTSDPPLPPCYECKAARVIFVIISTYVLSLGPYCFLAVLAVWVDIDT 532 

Query: 320 QVPQWVI T 1 1 1 WLFFLQCC I HP YVYGYMHKT I KKEIQDMLXXXXXXXXXXXXDSHPDL — 377 

+VPQWVITI I IWLFFLQCCIHPYVYGYMHK+IKKEIQ++L DSHPDL 
Sbjct: 533 RVPQWVITIIIWLFFLQCCIHPYVYGYMHKSIKKEIQEVLKKLICKKSPPVEDSHPDLHE 592 

Query: 378 — PGTEGGTEGKIVPSYDSATFP 398 

GTEGG EGK VPS+DSAT P 
Sbjct: 593 TEAGTEGGIEGKAVPSHDSATSP 615 



> qi | 348817 83 | ref |XP 229186,2 | D similar to G protein-coupled receptor 101 [Rattus 
Length = 508 

Score = 404 bits (1037), Expect = e-111 

Identities = 191/242 (78%), Positives = 212/242 (87%) 

Query: 1 MTSTCTNSTRESNSSHTCMPLSKMPISLAHGIIRSTVLVIFLAASFVGNIVLALVLQRKP 60 

M S+CTNST+E+NSS C+PLSKMPIS+AHGIIRS VL+I L +FVGN+VL VL RKP 
Sbjct: 1 MPSSCTNSTQENNSSRVCLPLSKMPISIAHGIIRSWLLIILGVAFVGNWLGYVLHRKP 60 
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Query: 61 QLLQVTNRFIFNLLVTDLLQISLVAPWVVATSVPLFWPLNSHFCTALVSLTHLFAFASVN 120 

LLQVTNRFIFNLLVTDLLQ++LVAPWVV+T++P FWPLN HFCTALVSLTHLFAFASVN 
Sbjct: 61 HLLQVTNRF I FNLLVTDLLQVALVAPWVVSTAIPFFWPLN I HFCTALVSLTHLFAFASVN 120 

Query: 121 TIVWSVDRYLSIIHPLSYPSKMTQRRGYLLLYGTWIVAILQSTPPLYGWGQAAFDERNA 180 

TIVWS+DRYLSIIHPLSYPSKMT RR Y+LLYGTWI A LQSTPPLYGWG A FD+RNA 
Sbjct: 121 TIVVVSIDRYLSIIHPLSYPSKMTNRRSYILLYGTWIAAFLQSTPPLYGWGHATFDDRNA 180 

Query: 181 LCSMIWGASPSYTILSVVSFIVIPLIVMIACYSVVFCAARRQHALLYNVKRHSLEVRVKD 240 

CSMIWG SP+YT++SVVSF+VIPL VMIACYSVVF AARRQ ALLY K H +VRVKD 
Sbjct: 181 FCSMIWGDSPAYTVVSVVSFLVIPLGVMIACYSVVFGAARRQQALLYKAKSHRFQVRVKD 240 

Query: 241 CV 242 
V 

Sbjct: 241 SV 242 



Score = 197 bits (501), Expect = 3e-49 

Identities = 93/143 (65%), Positives = 106/143 (74%), Gaps = 4/143 (2%) 

Query: 260 MNIPEXXXXXXXXXXXXXXXXXCYQCKAAKVIFIIIFSYVLSLGPYCFLAVLAVWVDVET 319 

M IPE CY+CKAA+VIFIIIFSYVLSLGPYCFLAVLAVWVD+++ 
Sbjct: 366 MRIPESRPPSRRNSTSNPPLPPCYECKAARVIFIIIFSYVLSLGPYCFLAVLAVWVDIDS 425 

Query: 320 QVPQWVITIIIWLFFLQCCIHPYVYGYMHKTIKKEIQDMLXXXXXXXXXXXXDSHPDL— 377 

QVPQWVITI I IWLFFLQCC+HPYVYGYMHK+ IKKEI+++L DSHP+L 
Sbjct: 426 QVPQWVITI I IWLFFLQCCVHPYVYGYMHKSIKKEIKEVLKKLTCKKSTSVDDSHPELRE 485 

Query: 378 — PGTEGGTEGKIVPSYDSATFP 398 

GTEGGTEGK +PS+DSAT P 
Sbjct: 486 TEAGTEGGTEGKAI PSHDSATSP 508 



> qi|24476016|ref jNP 722561.11 D G protein-coupled receptor 161; G-protein coupled 
sapiens] 

qi I 20381353 jqb 1AAH28163.1I D G protein-coupled receptor 161 [Homo sapiens] 
qi I 30526190 I qb| AAP32300 . 1 1 D G-protein coupled receptor RE 2 [Homo sapiens] 
Length * 529 

Score « 137 bits (346), Expect = 3e-31 

Identities = 87/323 (26%), Positives = 151/323 (46%), Gaps - 21/323 (6%) 

Query: 37 VLVIFLAASFVGNIVLALVLQRKPQLLQVTNRFIFNLLVTDLLQISLVAPWWATSVPLF 96 

V+ IF+ +GN+V+ + L +K LL ++N+F+F+L +++ L LV P+W +S+ 
Sbjct: 36 VITIFVC LGNLVIWTLYKKSYLLTLSNKFVFSLTLSNFLLSVLVLPFWTSSIRRE 92 

Query: 97 WPLNSHFCTALVSLTHLFAFASVNTIVWSVDRYLSIIHPLSYPSKMTQRRGYLLLYGTW 156 

W +C L L + AS+ T+ V+++DRY ++++P+ YP K+T R + L W 

Sbjct: 93 WI FGWWCNFSALLYLLISSASMLTLGVIAI DRYYAVLYPMVYPMKITGNRAVMALVYIW 152 

Query: 157 I VAILQSTPPLYGWGQAAFDERNALCSMIWGASPSYTILSWSFIVIPLI VMIACYSVVF 216 

+ +++ PPL+GW FDE +C W P YT + + P +VM+ CY +F 
Sbjct: 153 LHSLIGCLPPLFGWSSVEFDEFKWMCVAAWHREPGYTAFWQIWCALFPFLVMLVCYGFIF 212 

Query: 217 CAARRQHALLYNVKRHSLEVRVKDCVXXXXXXXXXXXXXXXXXMNIPEXXXXXXXXXXXX 276 
AR V+ + + 
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Sbjct: 213 RVAR VKARKVHCGTVVIVEEDAQRTGRKNSSTSTSSSGSRRNAFQ 257 

Query: 277 XXXXXCYQCKAAKVIFI I I FSYVLSLGPY-CFLAVLAVWVDVETQVPQWVITI I IWLFFL 335 

QCKA I +++ +++++ GPY +A A+W ++ V + T WL F 
Sbjct: 258 G VVYS ANQCKAL I T I L WLGAFMVTWG P YM VVI AS EALW — GKS S VS PSLET WATWLS FA 315 

Query: 336 QCCIHPYVYGYMHKTIKKEIQDM 358 

HP +YG +KT++KE+ M 
Sbjct: 316 SAVCHPLI YGLWNKTVRKELLGM 338 



> qi | 667 7701 I ref I NP 031395 . 1 1 D G protein-coupled receptor 161; G-protein coupled r 
sapiens] 

qi I 3659903 I qb I AAC61598 . 1 I D G-protein coupled receptor RE2 [Homo sapiens] 
Length = 407 

Score = 134 bits (336), Expect = 4e-30 

Identities = 88/330 (26%), Positives = 153/330 (46%), Gaps = 20/330 (6%) 

Query: 32 IIRSTVLVIFLAASFV — GNIVLALVLQRKPQLLQVTNRFIFNLLVTDLLQISLVAPWVV 89 

+1 + + I + FV GN+V+ + L +K LL ++N+F+F+L +++ L LV P+VV 
Sbjct: 26 VIITQFIAIIVITIFVCLGNLVIWTLYKKSYLLTLSNKFVFSLTLSNFLLSVLVLPFW 85 

Query* 90 ATSVPLFWPLNSHFCTALVSLTHLFAFASVNTIVWSVDRYLSIIHPLSYPSKMTQRRGY 14 9 

+S + W +C L L + AS+ T+ V+++DRY ++++P+ YP K+T R 

Sbjct: 86 TSSIRREWIFGWWCNFSALLYLLISSASMLTLGVIAIDRYYAVLYPMVYPMKITGNRAV 145 

Query 150 LLLYGTWIVAILQSTPPLYGWGQAAFDERNALCSMIWGASPSYTILSWSFIVIPLIVMI 209 

+ L W + +++ PPL+GW FDE +C W P YT + + P +VM+ 
Sbjct: 146 MALVYIWLHSLIGCLPPLFGWSSVEFDEFKWMCVAAWHREPGYTAFWQIWCALFPFLVML 205 

Query: 210 ACYSWFCAARRQHALLYNVKRHSLEVRVKDCVXXXXXXXXXXXXXXXXXMNIPEXXXXX 269 

CY +F AR V+ + + 

Sbjct: 206 VCYGFIFRVAR VKARKVHCGTWIVEEDAQRTGRKNSSTSTSSSG 250 

Query: 270 XXXXXXXXXXXXCYQCKAAKVIFIIIFSYVLSLGPY-CFLAVLAVWVDVETQVPQWVITI 328 

QCKA I +++ +++++ GPY +A A+W ++ V + T 
Sbjct: 251 SRRNAFQGWYSANQCKALITILWLGAFMVTWGPYMWIASEALW — GKSSVSPSLETW 308 

Query: 329 I I WLFFLQCC I HPYVYG YMHKT I KKE IQDM 358 

WL F HP +YG +KT++KE+ M 

Sbjct: 309 ATWLSFASAVCHPLI YGLWNKTVRKELLGM 338 



> qi I 29611554 I qb I AAO85088.il G protein-coupled receptor GPR101 [Mus musculus] 
Length « 77 

Score = 129 bits (324), Expect = 9e-29 
Identities - 62/76 (81%), Positives = 68/76 (89%) 

Query: 141 SKMTQRRGYLLLYGTWIVAILQSTPPLYGWGQAAFDERNALCSMIWGASPSYTILSWSF 200 

SKMT RR Y+LLYGTWI A LQSTPPLYGWG A FD+RNA CSMIWGASP+YT++SWSF 
Sbjct: 1 SKMTNRRSYILLYGTWIAAFLQSTPPLYGWGHATFDDRNAFCSMIWGASPAYTVVSWSF 60 

Query: 201 IVIPLIVMIACYSWF 216 

+VIPL VMIACYSWF 
Sbjct: 61 LVIPLGVMIACYSWF 76 
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> qi I 884 3925 | qb I AAF801 68 . 1 1 alpha la-adrenoceptor isoform 2 [Oryctolagus cuniculus] 
Length = 429 

Score = 124 bits (311), Expect = 3e-27 

Identities = 90/353 (25%), Positives = 160/353 (45%), Gaps = 18/353 (5%) 

Query: 7 NSTRESNSSHTCMPLSKMPISLAHGIIRSTVLVIFLAASFVGNIVLALVLQRKPQLLQVT 66 

N + + SN + H P+ + + + 1+ + L + +GNI++ L + L VT 

Sbjct: 7 NASDSSNCTH PPAPVNISKAILLGVILGGLILFGVLGNILVILSVACHRHLHSVT 61 

Query: 67 NRFIFNLLVTDLLQISLVAPWVVATSVPLFWPLNSHFCTALVSLTHLFAFASVNTIVVVS 12 6 

+ +1 NL V DLL S V P+ + +W FC ++ L AS+ + + V+S 

Sbjct: 62 HYYIVNLAVADLLLTSTVLPFSAIFEILGYWAFGRVFCNIWAAVDVLCCTASIISLCVIS 121 

Query: 127 VDRYLSIIHPLSYPSKMTQRRGYLLLYGTWIVAILQSTPPLYGWGQAAFDERNALCSMIW 186 

+DRY+ + + PL YP+ +TQRRG L W +++ S PL+GW Q A D+ +C + 
Sbjct: 122 IDRYIGVSYPLRYPTIVTQRRGLRALLCVWAFSLVISVGPLFGWRQPAPDDE-TICQI— 178 

Query: 187 GASPSYTILSVVSFIVIPLIVMIACYSVVFCAARRQHALLYNVKRHSLEVRVKDCVXXXX 246 

P Y + S + + PL +++A Y V+ A+R+ L + L+ D 
Sbjct: 179 NEEPGYVLFSALGSFYVPLT I ILAMYCRVYWAKRESRGL KSGLKTDKSDSEQVTL 234 

Query: 247 XXXXXXXXXXXXXMNIPEXXXXXXXXXXXXXXXXXCYQCKAAKVIFIIIFSYVLSLGPYC 306 

+ + + KAAK + I++ +VL P+ 

Sbjct: 235 RIHRKNAPAGGSGVASAKNKTHFSVRLLKFSR EKKAAKTLGIWGCFVLCWLPFF 289 

Query: 307 FLAVLAVWVDVETQVPQWVIT I I I WLFFLQCCI HPYVYGYMHKT I KKE IQDML 359 

+ + + + + P+ v 1+ WL +L CI+P +Y + KK Q++L 
Sbjct: 290 LVMPIGSFFP-DFKPPETVFKIVFWLGYLNSCINPIIYPCSSQEFKKAFQNVL 341 



> qi|3023219|sp 1 002824 I A1AA RABIT Alpha-IA adrenergic receptor (Alpha lA-adrenocept 
adrenergic receptor) 
qi I 2198745 I qb I AAB61334 . 1 1 alpha la-adrenoceptor [Oryctolagus cuniculus] 
Length = 466 



Score = 124 bits (311), Expect = 3e-27 

Identities = 90/353 (25%), Positives = 160/353 (45%), Gaps = 18/353 (5%) 



Query: 


7 


NSTRESNSSHTCMPLSKMPISLAHGIIRSTVLVIFLAAS FVGNIVLALVLQRKPQLLQVT. 


66 




N++ SN +H P++++ 1+ +L + +GNI++ L + L VT 




Sbjct: 


7 


NASDSSNCTH PPAPVN I SKAI LLGVI LGGL I LFGVLGN I LVILS VACHRHLHS VT 


61 


Query: 


67 


NRFIFNLLVTDLLQISLVAPWWATSVPLFWPLNSHFCTALVSLTHLFAFASVNTIVWS 


126 




+ +1 NL V DLL S V P+ + +W FC ++ L AS+ ++ V+S 




Sbjct: 


62 


HYYIVNLAVADLLLTSTVLPFSAIFEILGYWAFGRVFCNIWAAVDVLCCTASIISLCVIS 


121 


Query: 


127 


VDRYLSIIHPLSYPSKMTQRRGYLLLYGTWIVAILQSTPPLYGWGQAAFDERNALCSMIW 


186 




+DRY+ + +PL YP+ +TQRRG L W +++ S PL+GW Q A D+ +C + 




Sbjct: 


122 


IDRYIGVSYPLRYPTIVTQRRGLRALLCVWAFSLVISVGPLFGWRQPAPDDE-TICQI — 


178 


Query: 


187 


GASPSYTILSWSFIVIPLIVMIACYSWFCAARRQHALLYNVKRHSLEVRVKDCVXXXX 


246 




P Y + S + +PL +++A Y V+ A+R+ L + L+ D 




Sbjct: 


179 


NEEPGYVLFSALGSFYVPLTI ILAMYCRVYWAKRESRGL KSGLKTDKSDSEQVTL 


234 


Query: 


247 


XXXXXXXXXXXXXMNI PEXXXXXXXXXXXXXXXXXCYQCKAAKVI FI I I FS YVLSLGPYC 


306 
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+ + + KAAK + I + + +VL P+ 

Sbjct: 235 RIHRKNAPAGGSGVASAKNKTHFSVRLLKFSR EKKAAKTLGIVVGCFVLCWLPFF 289 

Query: 307 FLAVLAVWVDVETQVPQWVIT I I I WLFFLQCC I HPYVYGYMHKT I KKE IQDML 359 

+ + + + + P+ V 1+ WL +L CI+P +Y + KK Q++L 
Sbjct: 290 LVMPIGSFFP-DFKPPETVFKIVFWLGYLNSCINPIIYPCSSQEFKKAFQNVL 341 



> qi I 2 6788019 | emb I CAC94 897 . 2 | SI:bZ20I5.4 (novel protein similar to human G-protein 
receptor RE2) [Danio rerio] 
Length = 526 

Score = 124 bits (311), Expect - 3e-27 

Identities - 86/312 (27%), Positives = 147/312 (47%), Gaps = 18/312 (5%) 

Query: 47 VGNIVLALVLQRKPQLLQVTNRFIFNLLVTDLLQISLVAPWVVATSVPLFWPLNSHFCTA 106 

+GN+V+ + L +KP LL +N+F+F+L ++LL L+ P+VVA+SV W +C 
Sbjct: 39 LGNLVIWTLYKKPYLLTPSNKFVFSLTSSNLLLSVLMLPFWASSVRRDWMFGVVWCNF 98 

Query: 107 LVSLTHLFAFASVNTIVVVSVDRYLSIIHPLSYPSKMTQRRGYLLLYGTWIVAILQSTPP 166 

L L + +S+ T+ +++DRY ++++P+ YP K+T R L + W+ +++ PP 
Sbjct: 99 TALLHLLVSSSSMLTLGAIAIDRYYAVLYPMIYPMKITGNRAVLAIVYIWLHSLVGCLPP 158 

Query: 167 LYGWGQAAFDERNALCSMIWGASPSYTILSVVSFIVIPLIVMIACYSVVFCAARRQHALL 226 

L+GW FD C++ W SYT V ++PL+ M+ CY V+F AR + + 

Sbjct: 159 LFGWSSFEFDRFKWTCTVSWHKEISYTAFWVTWCCLLPLVAMLVCYGVIFRVARIKARKV 218 

Query: 227 YNVKRHSLEVRVKDCVXXXXXXXXXXXXXXXXXMNIPEXXXXXXXXXXXXXXXXXCYQCK 286 

Y C + QCK 
Sbjct: 219 Y CGSWVSQEESSSQNNGRKNSNTSTSSSGSRKSLIYSGS QCK 261 

Query: 287 AAKVIFI I I FSYVLSLGPYC FLAVLAVWVDVETQVPQWVIT I I IWLFFLQCC I HP YVYGY 34 6 

A I +++ +++ + GPY + + + PQ V T++ WL F HP +YG 

Sbjct: 262 AFITILWLGT FLTTWGPYVWISTEALLGKNSVSPQ-VETLVSWLSFTSAVCHPLIYGL 320 

Query: 347 MHKTIKKEIQDM 358 

+KT++KE+ M 
Sbjct: 321 WNKTVRKELLGM 332 



Database: All non-redundant GenBank CDS 
translations+PDB+SwissProt+PIR+PRF 

Posted date: Jan 5, 2004 12:06 AM 
Number of letters in database: 521,258,069 
Number of sequences in database: 1,591,231 

Lambda K H 

0.328 0.139 0.442 

Gapped 

Lambda K H 

0.267 0.0410 0.140 



Matrix: BLOSUM62 

Gap Penalties: Existence: 11, Extension: 1 
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Number of Hits to DB: 33,689,817 

Number of Sequences: 1591231 

Number of extensions: 1154639 

Number of successful extensions: 5029 

Number of sequences better than 10.0: 734 

Number of HSP's better than 10.0 without gapping: 402 

Number of HSP's successfully gapped in prelim test: 332 

Number of HSP's that attempted gapping in prelim test: 3938 

Number of HSP's gapped (non-prelim): 964 

length of query: 398 

length of database: 521,258,069 

effective HSP length: 127 

effective length of query: 271 

effective length of database: 319,171,732 

effective search space: 86495539372 

effective search space used: 86495539372 

T: 11 

A: 40 

XI: 15 ( 7.1 bits) 
X2: 38 (14.6 bits) 
X3: 64 {24.7 bits) 
SI: 40 (21.8 bits) 
S2: 74 (33.1 bits) 
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r 1: NP 473362 . G protein-coupled...[gi: 16876435] 
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LOCUS NP_473362 508 aa linear PRI 05-OCT-2003 

DEFINITION G protein-coupled receptor 101 [Homo sapiens] . 

ACCESSION NP_47 3362 

VERSION NP_473362.1 GI:16876435 

DBSOURCE REFSEQ: accession NM 054021 . 1 

KEYWORDS 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 (residues 1 to 508) 

AUTHORS Takeda,S., Kadowaki, S . , Haga,T., Takaesu,H. and Mitaku,S. 

TITLE Identification of G protein-coupled receptor genes from the human 

genome sequence 
JOURNAL FEBS Lett. 520 (1-3), 97-101 (2002) 
MEDLINE 22040266 
PUBMED 12044878 
REFERENCE 2 (residues 1 to 508) 

AUTHORS Lee, D. K. , Nguyen, T., Lynch, K.R., Cheng, R., Vanti,W.B., Arkhitko,0., 

Lewis, T., Evans, J. F., George,S.R. and 0'Dowd,B.F. 
TITLE Discovery and mapping of ten novel G protein-coupled receptor genes 

JOURNAL Gene 275 (1), 83-91 (2001) 
MEDLINE 21458557 
PUBMED 11574155 

COMMENT PROVISIONAL REFSEQ : This record has not yet been subject to final 

NCBI review. The reference sequence was derived from AF411115.1 . 

Summary: G protein-coupled receptors (GPCRs, or GPRs) contain 7 
transmembrane domains and transduce extracellular signals through 
heterotrimeric G proteins . [supplied by OMIM] . 
FEATURES Location/Qualifiers 
source 1 . . 508 

/organism="Homo sapiens" 
/db_xref="taxon: 9606" 
/chromosome="X" 
/map="Xq25-27.1" 
Protein 1..508 

/product="G protein-coupled receptor 101" 
Region 48. .240 

/region_name="7 transmembrane receptor (rhodopsin family)" 
/note="7tm_l" 
/db_xref ="CDD : pfamOOOOl " 
variation 124 

/allele="V" 
/allele="L" 

/db_xref ="dbSNP : 1190736 " 
variation 376 
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/allele="P" 
/allele= M L" 

/db_xref ="dbSNP : 5931046 " 
Region 396 . .454 

/region_name="7 transmembrane receptor (rhodopsin family)" 

/note="7tm_l" 

/db_xre f ="CDD : pfamOQOOl " 
CDS 1..508 

/gene="GPR101" 

/coded_by="NM_054021. 1: 1. .1527" 

/note="go_component : integral to membrane [goid 0016021] 
[evidence IEA] ; 

go_function: receptor activity [goid 0004872) [evidence 
IEA] ; 

go_function: rhodopsin-like receptor activity [goid 
0001584] [evidence IEA]; 

go_process: G-protein coupled receptor protein signaling 
pathway [goid 0007186] [evidence IEA] " 
/db_xref="GeneID: 83550" 
/db_xref="LocusID: 83550" 
/db xref="MIM: 300393 " 



ORIGIN 



1 mtstctnstr esnsshtcmp lskmpislah giirstvlvi flaasfvgni vlalvlqrkp 

61 qllqvtnrfi fnllvtdllq islvapwvva tsvplfwpln shfctalvsl thlfafasvn 

121 tivvvsvdry lsiihplsyp skmtqrrgyl llygtwivai lqstpplygw gqaafderna 

181 lcsmiwgasp sytilsvvsf iviplivmia cysvvfcaar rqhallynvk rhslevrvkd 

241 cvenedeega ekkeefqdes efrrqhegev kakegrmeak dgslkakegs tgtsessvea 

301 rgseevress tvasdgsmeg kegstkveen smkadkgrte vnqcsidlge ddmefgeddi 

361 nfseddveav nipeslppsr rnsnsnpplp rcyqckaakv ifiiifsyvl slgpycflav 

421 lavwvdvetq vpqwvitiii wlfflqccih pyvygymhkt ikkeiqdmlk kffckekppk 

481 edshpdlpgt eggtegkivp sydsatfp 
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Abstract 



Background: G-protein-coupled receptors (GPCRs) are the largest and most diverse family of 
transmembrane receptors. They respond to a wide range of stimuli, including small peptides, lipid 
analogs, ammo-acid derivatives, and sensory stimuli such as light, taste and odor, and transmit 
signals to the interior of the cell through interaction with heterotrimeric G proteins. A large 
number of putative GPCRs have no identified natural ligand. We hypothesized that a more 
complete knowledge of the phylogenetic relationship of these orphan receptors to receptors 
with known ligands could facilitate ligand identification, as related receptors often have ligands 
with similar structural features. 

Results: A database search excluding olfactory and gustatory receptors was used to compile a 
list of accession numbers and synonyms of 81 orphan and 196 human GPCRs with known Ugands. 
Of these, 241 sequences belonging to the rhodopsin receptor-like family A were aligned and a 
tentative phylogenetic tree constructed by neighbor joining. This tree and local alignment tools 
were used to define 19 subgroups of family A small enough for more accurate maximum- 
likelihood analyses. The secretin receptor-like family B and metabotropic gtutamate receptor-like 
family C were directly subjected to these methods. 

Conclusions: Our trees show the overall relationship of 277 GPCRs with emphasis on orphan 
receptors. Support values are given for each branch. This approach may prove valuable for 
Identification of the natural ligands of orphan receptors as their relation to receptors with known 
ligands becomes more evident. 



Background 

G-protein-coupled receptors (GPCRs) are the largest and 
most diverse family of transmembrane receptors. They 
respond to a wide range of stimuli including small peptides, 
lipid analogs, amino-acid derivatives, and sensory stimuli 
such as light, taste and odor [1], and transmit signals to the 
interior of the cell through interaction with heterotrimeric G 



proteins. Certain amino-acid residues of this receptor family 
are well conserved and approaches exploiting this, such as 
low-stringency hybridization and degenerate PCR, have been 
used to clone new members of this large superfamUy [2-4]. 
Many of these putative receptors share GPCR structural 
motifs, but still lack a defined physiologically relevant ligand. 
One strategy to identify the natural ligand of these so-called 



Genome Biology Vol 3 No II .loost and Methner 



orphan receptors uses changes in second-messenger 
activation in cells stably expressing the receptor in response 
to tissue extracts expected to contain the natural ligand [5]. 
In a second step, these extracts are tested and fractionated to 
purity, before being analyzed by mass spectrometry. This 
strategy led to the identification of several novel bioactive 
peptides or peptide families (for review see [6]). The identifi- 
cation of these natural ligands is likely to give further insight 
into the physiological role of these receptors and advance the 
design of pharmacologically active receptor agonists or 
antagonists. This is of particular interest, as GPCRs are the 
most targeted protein superfamily in pharmaceutical 
research [7]. Better prediction of the presumed chemical 
class or structure of the ligand facilitates the identification of 
orphan receptors by the strategy described above, as the 
ligand purification process can be tailored more specifically 
to the assumed class of substances. 

Phylogenetic analysis of receptor relationships has already 
been used to elucidate the chemical nature of receptor 
ligands. The identification of sphingosine l-phosphate as the 
ligand for the GPCR EDG-i led to the prediction that EDG-3, 
EDG-5, EDG-6 and EDG-8 have the same ligand [8-11]. In 
contrast, phylogenetically distinct members of the EDG 
cluster - EDG-2, EDG-4 and EDG-7 - are receptors for the 
similar but distinct ligand lysophosphatidic acid (LPA) 
[12-14]. Neuromedin U, a potent neuropeptide that causes 
contraction of smooth muscle, was correctly predicted 
phylogenetically to be the ligand of the orphan GPCR FM3 
(NMUR) [15]. Not only the ligand, but also the pharmacol- 
ogy of a novel receptor for histamine, was predicted and con- 
firmed through phylogeny [16]. GPR86, related to the ADP 
receptor P2Y12, was similarly recently shown to bind ADP 
[17], and UDP-glucose, a molecule involved in carbohydrate 
biosynthesis, was shown to be the ligand for the related 
receptor KIAA0001 [18]. 

Mammalian GPCRs were previously classified by phylogeny 
into three families [19,20]: the rhodopsin receptor-like 
family (A), the secretin receptor-like receptor family (B) and 
the metabotropic glutamate receptor family (C). These 
results were generated by neighbor joining, a fast distance- 
based method suited for large datasets, but influenced by 
methodological flaws that can in part be overcome by 
methods not generally applied previously. 

In this work, we compiled an exhaustive list that includes all 
available synonyms and accession numbers of 196 human 
GPCRs with known ligands and 84 human orphan receptors. 
The 241 sequences belonging to family A were aligned, and a 
tentative tree constructed by neighbor joining with 1,000 
bootstrap steps. Subgroups of family A defined by this tree 
and sequences from families B and C were then used for 
more accurate phylogenetic analysis by state-of-the-art tech- 
niques. From this analysis, we tried to predict possible 
ligands for orphan receptors. 



Results and discussion 

We set out to define the phylogenetic relationship of human 
GPCRs by state-of-the-art tools, assuming that the identifi- 
cation of cognate ligands of orphan receptors will be facili- 
tated by a more complete knowledge of their relationship 
within the large and diverse superfamily. 

Database mining and multiple sequence alignment 

Most receptors were identified by different groups; there- 
fore, many confusing names and synonyms exist. We 
adhered to SWISS-PROT names where possible, and com- 
piled a list including all available synonyms and accession 
numbers of 196 human GPCRs with known ligands and 84 
human orphan receptors (Table 1 shows all receptors men- 
tioned in this work; the complete list is supplied as an addi- 
tional data file with the online version of this paper). 
Gustatory and olfactory receptors were omitted. Multiple 
protein sequences were aligned and the extremely variable 
amino termini upstream of the first transmembrane domain 
and carboxyl termini downstream of the seventh transmem- 
brane domain were deleted to avoid length heterogeneity 
(see Figure 1). The deleted regions contained no significant 
sequence conservation. 

Phylogenetic analysis 

Because of the large number of sequences in family A, we 
had to use a combination of computational methods to 
accomplish the best possible description of their phyloge- 
netic relationship. In a first step we used the distance-based 
neighbor-joining method as the only one computationally 
feasible. Neighbor joining has been shown to be efficient at 
recovering the correct tree topology [21], but is greatly influ- 
enced by methodological errors, for example, the sampling 
error [22]. This can in part be overcome by bootstrapping, a 
method of testing the reliability of a dataset by the creation 
of pseudorepticate datasets by resampling. Bootstrapping 
assesses whether stochastic effects have influenced the dis- 
tribution of amino acids [23]. In previous publications on 
this topic* bootstrapping has not been generally used. 

We generated a neighbor-joining tree of family-A sequences, 
and considered tree branches to be confirmed if they were 
found in more than 500 of 1,000 bootstrap steps (Figure 2). 
Hie same branching pattern was found by least squares 
(data not shown) as implemented in FITCH [24], but it was 
not possible to compute enough bootstrap steps with the 
equipment used. The remaining sequences of unconfirmed 
branches were then assigned to existing branches according 
to results obtained with the local alignment tool BLASTP 
(see Additional data files) [25] to account for similarities in 
parts of the sequences not sufficient for repeated global 
alignment The p-value was used as a measure of similarity. 

As this strategy still left four subgroups too large for detailed 
analyses, we recalculated neighbor-joining trees and in 
some cases least-square trees of these sequences to create 
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Table I 



List of example receptor names, accession numbers and abbreviations 



Receptor Group Accession no. Names and synonyms 



Human GPCR - Family A 



ADMR 


A02 


O f 52 1 8 


Adrenomedullin receptor, Am-R 


APJ 


A03 


D1C j* 1 A 


Apelin receptor, Apj, Agtrl 1 


CMLI 


a no 

AUO 




^ncrnuKine rccepior*tiKe i, uez, Lncmnj, Vvii&j, v*riiKiri 




Am 


nootn 


^nemoKinc recepior-iiKe x, nuw-iiiuuceu ciiuuiiicnai w prwiGiii"\.uupic(i receptor, 
Feg-I, Gpr30, Cmkrl2, Dry 12, Cepr 


DUFF 


A02 


Q 16570 


Duffy antigen, Fy glycoprotein, glycoprotein D, Gpfy, Fy, Gpd, Dare 


EDG 1 


AI3 


P2I453 


Endothelial differentiation, Sphingosine 1 -phosphate receptor, Lp-B t 


EDG2 


AI3 


Q92633 


Endothelial differentiation, lysophosphatidic acid receptor, Lp-A 1 , Vzg- 1 


EDG3 


AI3 


Q 99500 


Endothelial differentiation, lysosphingolipid receptor, Lp-83 




A13 


NM 004720 


Endothelial differentiation, lysophosphatidic acid receptor, Lp-A2 


EDG5 


AI3 


NP 004221 


Endothelial differentiation, sphingolipid receptor. Lp-B2, H2I8. Agrl6 


EDG6 


AI3 


AI000479 


Endothelial differentiation, lysosphingolipid receptor, Lp-CI 




AI3 


NP 036284 


Endothelial differentiation, lysophosphatidic acid receptor, Lp-A3 


EOG8 


AI3 


NP 110387 


Endothelial differentiation, sphingosine 1 -phosphate receptor, Lp-B4 


ETRR.LP2 


A07 


Y 16280 


Endothelin B receptor-like proteln-2, Etbrtp2, Ebp2, Cns2 


FSHR 


AIO 


P23945 


Follicle stimulating hormone receptor, Fsh-R, folli tropin receptor 


GPR 


A06 


NM 007223 


G protein-coupled receptor 


APR 1 
urnl 


AAD 
MVO 


P4609I 


G r»rnfrf»i n.r r»unl*»rl receotor Gorl 

UI VtCIII% VUIIICU I CtC LI Vw 1 V 


urnj 


AI3 


P46089 


G protein -coupled receptor, Acca orphan receptor 


GPR6 


Al 3 


P46095 


G protein-coupled receptor 6 


rpDT 


A04 


P48I45 


G protein-coupled receptor 7 


/■DDD 

urno 


Aft A 

AV*t 


P48I46 


G nrnfpln>f oudIm! rpccotor 8 

III Vivlll^VUUICv ■ VVVVivl W 


/•DD1C 




mm an c 70ft 


G nmft>in«rnimlpH f#»f Ant"nr 25 


UrRI/ 


A 1 tt 




\3 proiein-cvuijjieu rcvcpiur x » , <jicu i 


urKJ4 


A 1 *> 
Al A 


mm nncinfl 


vj pro(cin*coupica rcvcpwir, vipr y 


<"DD1C 

UrKJ> 


AIC 
Al 9 


mm nncmi 


nrnt'oln.^M ml A/i raranfnr ^ Q 

u protdn-toupicu retepiwr jj 


Cj PRJ7 


AU / 


mm nnnm 






A AT 

AO/ 


Oil I 04 


\j procein-coupieu receptor \xpr<*7 


wrMU 


A 1 1 

Al 1 




w proietn-coupiea rtsttipiur \jpr*tw 




All 

Al 1 




w procein-cuupieo rtvepvur vjpr** i , n>a a n 


GPR42 


All 

Al 1 




w protein-coupieo receptor vjpr*?* 


GPR43 


Al 1 


ai et 
OIS552 


G protein^coupled receptor Gpr43 


GPR44 


A08 


A A 1 Arc 

AAD2I055 


G protein-coupled receptor 44 


GPR44 


AOS 


A A 1 AC F 


G protein-coupled receptor 44 


GPR46 


AIO 


NM_0 1 8490 


G protein-coupled receptor 48 


GPR49 


AIO 


NM.003667 


G protein-coupled receptor 49, Hg38, G protein-coupled receptor 67, Fex 


GPR52 


AI8 


Q9Y2T5 


G protein-coupled receptor GprS2 


GPR55 


AI5 


NM.005683 


G protein-coupled receptor 55 


GPRS 7 


AI7 


NM_0 14627 


G protein-coupled receptor 57 


GPR58 


AI7 


NM_0 14626 


G protein-coupled receptor 56 


GPR6I 


AI8 


AF3 17652 


G protein-coupled receptor 6 1 


GPR62 


AI8 


AF3 17653 


G protein-coupled receptor 62 


GPR63 


AI8 


AF3 17654 


G protein-coupled receptor 63 


GPR72 


A09 


NM.0 16540 


G protein-coupled receptor 72, Jp05 


GPR73 


A09 


AAE24084 


G protein-coupled receptor 73 


GPR7S 


A09 


NM.006794 


G protein-coupled receptor 75 


GPR80 


All 


AF4III09 


G protein-coupled receptor 80 


GPR6I 


All 


AF4IIM0 


G protein-coupled receptor 8 1 


GPR85 


AI8 


NM.0 18970 


G protein-coupled receptor 85, Sreb2 


GPR86 


AI2 


NP.0764O3 


Adp receptor 


GPR67 


AI2 


NM.0239I5 


G protein-coupled receptor 87 


GPR86 


AI8 


NM.022049 


G protein-coupled receptor 88 


GPR9I 


All 


NM_03J050 


G protein-coupled receptor 9 1 
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Table 1 (continued) 


Receptor 


Group 


Accession No. 


Names & Synonyms 


GPRI01 


AI8 


NM.0S402I 


G protein-coupled receptor 101 


GPRI02 


AI7 


NMJJS3278 


G protein-coupled receptor 102 


GPR103 


A06 


AF4 11117 


G protein-coupled receptor 103 


GPRC 


AI3 


P47775 


Gprl2 


GPRF 


A03 


P49685 


GprlS, Bob 


GPRJ 


A09 


QI5760 


Gprl9, Gpr-Nga 


GPRL 


AI8 


Q99679 


Gpr2l 


GPRM 


A06 


Q99680 


Gpr22 


GPRV 


All 


000270 


Gpr3l 


CPRW 


A08 


075388 


Gpr32 


HM74 


All 


P490I9 


G protein-coupled receptor Hm74 


KI01 


AI2 


QI539I 


Udp-Glucose receptor. KiaaOOOl 


LSHR 


AtO 


P22888 


Lutropln-choriogonadotropic hormone receptor, Lh/Cg-R. Lsh-R, luteinizing hormone receptor, 








Lhcgr, Lhrhr, Lcgr 


MAS 


A08 


P0420I 


Mas proto-oncogene, Masl 


ML! A 


A09 


P48039 


Melatonin receptor Type la, Mel-la-R. Mtnrla 


ML1B 


A09 


P49286 


Melatonin receptor Type lb, Mel-lb-R, Mtnrl b 


MLIX 


A09 


QI3585 


Melaton In-related receptor, H9 t GprSO 


MRG 


A08 


P354I0 


Mas-related G protein-coupled receptor 


NMUIR 


A07 


AF272362 


Neuromedin U receptor 1. Nmurl, Gpr66, Fm-3 


NTR1 


A07 


P30989 


Neurotensin receptor Type 1. Nt-R-I. Ntsrl, Ntrr 


NTR2 


A07 


095665 


Neurotensin receptor Type 2, Nt-R-2, levocabastine-sensitive neurotensin receptor, Ntr2 








receptor, Ntsr2 


NYIR 


A09 


P25929 


Neuropeptide Y receptor Type 1 , Npyl-R, Npylr, Npyr, Npyyl 


NY2R 


A09 


P49I46 


Neuropeptide Y receptor Type 2, Npy2-R, Npy2r 


NY4R 


A09 


P5039I 


Neuropeptide Y receptor Type 4, Npy4-R. Pancreatic Polypeptide receptor 1, Ppl, Ppyrl. 


Npy4r 








P2Y5 


AI5 


P43657 


P2y purinoceptor 5, P2y5 t purinergic receptor 5, P2ry5, 6hl 


P2Y7 


AOS 


Q 15722 


P2y purinoceptor 7, P2y7, Leukotriene B4 receptor, Chemoattractant receptor-like 1 . 








P2ry7. P2y7, Gprl6. CmkHI, Ltb4r 


P2Y9 


AI5 


Q99677 


P2y purinoceptor 9, P2y9 f purinergic receptor 9 ( Gpr23, P2ry9 


P2YI0 


AI5 




Putative purinergic receptor P2yl0 


P2YI2 


AI2 


AF3 13449 


Adp receptor. Spl999 


PAFR 


AI2 


P25I05 


Platelet Activating Factor receptor, Paf-R, Ptafr 


PNR 


AI7 


AF02I8I8 


Putatfve neurotransmitter receptor 


PSP24 


AI8 


U92642 


High-affinity lysopfiosphatldic add receptor homoiog, Gpr45 


RDCI 


A02 


P25I06 


G protein-coupled receptor Rdcl homoiog 


RE2 


AI8 


AF09I890 


G protein-coupled receptor Re2 


SALPR 


A05 


NM_0 16568 


Somatostatin and angiotensln-flke peptide receptor, LocS 1 289 


SREB3 


AI8 


NM_0 18969 


Super conserved receptor expressed in brain 3 


TM7SFI 


A01 


AF027826 


Putative seven pass transmembrane protein 


TSHR 


AIO 


PI 6473 


I nyrotd somuiating normone receptor, thyrotropin receptor , i sn-i\ 


Human GPCR - 


Family B 






EMRI 


B 


Q 14246 


Cell surface glycoprotein emrl, Emrl hormone receptor 


EMR2 


B 


AFI 14491 


Egf-Mce module Emr2 


EMR3 


B 


AF239764 


EgMfke module-containing mucln-lilce receptor Emr3 


BAIt 


B 


OI45I4 


Brain-spech1c anglogenests Inhibitor 1 


6AI2 


B 


O6024I 


Brain-specific anglogenests Inhibitor 2 


BAI3 


B 


060242 


Brain-specific angiogenesls Inhibitor 3, KlaaOSSO 


GPRS6 


B 


NM.005682 


G protein-coupled receptor 56 


Human GPCR * 


Family C 






GPRC5B 


C 


NM.0I623S 


G PROTEIN-COUPLED RECEPTOR. FAMILY C GROUP 5, MEMBER ft, GPRC5B 


GPRCSC 


c 


NM.0 18653 


G protein-coupled receptor, family C t group 5 f member C, Gprc5c 


GPRC50 


c 


NH.0I86S4 


G protein-coupled receptor, family C, group 5, member D, Gprc5d 



A complete list b supplied as additional data file. Orphan receptors arc shown in bold. 
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subgroups Ai and 2, A4 and 5, An and 15 and A17 and 18. 
This approach finally resulted in 19 differently sized sub- 
groups of family A (Table 2) that were further subjected to 
the more reliable maximum-likelihood and quartet-puzzling 



algorithms. Maximum-likelihood approaches calculate the 
probability of the observed data assuming that it has evolved 
in accordance with a chosen evolutionary model. Phytoge- 
nies are then inferred by finding trees and parameters that 




Figure I 

An example multiple sequence alignment of seven receptors. Protein sequences of GPR87, KIOI . GPR86. P2YI2. H963. GPR34 and PAFR belonging to 
subgroup 12 were aligned with ClustalX and modified by deleting the extremely variable amino termini upstream of the first transmembrane domain and 
carboxyt termini downstream of the seventh transmembrane domain as Indicated. Identical amino-acid residues in all aligned sequences are shaded In 
bbek and similar residues m gray. Transmembrane (TM) domains Identified by the TMpred program are indicated 
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rigurc x 

Neighbor-joining tret of the rhodopsm receptor-like family A Inferred from the multiple sequence alignment using PHYUP 3.6. Support values for each 
internal branch were obtained by 1 .000 bootstrap steps, and are indicated. Pilrwise distances were determined with PROTDIST and the JTT substitution 
frequency matrbc The tree was calculated with NEIGHBOR using standard parameters and rooted with the distant, though related, fomity-B receptor 
GPRCSB as the outgroup. The consensus tree of all bootstrapped sequences was obtained with CONSENSU Orphan receptors are shown In bold. Scale 
bar Indicates the branch length of 100 substitutions per site. 
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Table 2 



Receptor subgroups derived from a combination of neighbor-joining and BLASTP results 



Al 


A2 


A3 


A4 


A5 


A6 


A7 


A8 


A9 


AIO 


All 


C3XI 


ADMR 


AG22 


GPR7 


GALR 


rr 1 K 




C3AR 


GPR72 


FSHR 


GPR40 


CKRI 


BONZO 


AG2R 


GPR8 


GALS 


FF2R 


CT 1 Q 
L 1 1 ft 


C5AR 


GPR73 


GPR48 


GPR4I 


CKR2 


CCRM 


AG2S 


OPRD 


GALT 


GASR 


CTDD 




(GPR75) 


GPR49 


GPR42 


CKR3 


CCR3 


APj 


OPRK 


GPR54 


(GPR) 


CTDD 1 DT 


C M t 1 


GPRA 


LSHR 


GPR43 


CKR4 


CCR4 


BRBI 


OPRM 


/— DD f> 


rpp 1 a? 


GHSR 


FMLI 


GPRJ 


TSHR 


GPR80 




ecu. ^ 


BRB2 


OPRX 


P2Y7 


(GPRM) 


GPR37 


FML2 


MLIA 




GPR81 


CKR8 


CKR6 


GPR2S 


SSRI 


SALPR 


GRHR 


GPR38 


FMLR 


ML IB 




GPR82 


CKRX 


CKR7 


GPRF 


SSR2 


UR2R 


OXIR 


GPR39 


GPRI 


MUX 




GPR9I 


CXCl 


CKR9 




SSR3 




OX2R 


GRPR 


GPR44 


NK.IK 




uri\T 


(TM7SFI) 


CKRA 




SSR4 




OXYR 


NMBR 


GPRW 


NK2R 








CML2 




SSR5 




VIAR 


NMUIR 


(MAS) 


NK3R 




DTI ID 
rZUK 




(DUFF) 








VIBR 


NMU2R 


(MRG) 


NK4R 




rZTI 1 




IL8A 








V2R 


NTRI 




NYIR 




D0Y4 




IL86 










NTR2 




KIVTD 

NTZK 




rX 1 O 




RDCI 










TRFR 




NY4R 




P0YR 


















NY5R 






AI2 


AI3 


AI4 


AIS 


AI6 


AI7 


AI8 


AI9 




B 


c 


GPR34 


ACTR 


PD2R 


EBI2 


OPSB 


5H2A 


* A ID 

AAl K 






BAD 


CASR 


GPR86 


CBIR 


PE2I 


G2A 


OPSD 


5H2B 


PJKLfK 


SHI B 




BA12 


GBRI 


GPR87 


CB2R 


PE22 


GPR35 


OPSG 


5H2C 


A AID 

AAzb 


i n 

3n 1 U 




BAI3 


GBR2 


H963 


EDGt 


PE23 


GPR4 


OPSR 


5H6 


A AID 


SHIE 




CALR 


GPRC5B 


KIOI 


EDG2 


PE24 


GPR55 


OPSX 


Al AA 




5H IF 




CD97 


GPRC5C 


P2YI2 


EDG3 


PF2R 


GPR6S 


RGR 


Al AB 


ACM") 


5H5A 




CGRR 


GPRC5D 


PAFR 


E0G4 


PGR 


GPR68 




AIAD 




5H7 




CRFI 


MGRI 




E0G5 


TA2R 


GPR92 




A2AA 


ACM4 






CRF2 


MGR2 




EDG6 




GPRH 




A2AB 


ACru 






EMRI 


MGR3 




EDG7 




GPRI 




A2AC 


GPRI 01 






CMR1 

cniu 


MGR4 




EDG8 




GPRK 




A2AD 


GPR27 






CMR1 


MGR5 




GPR3 




P2YI0 




BIAR 


GPR52 






vjlr?\ 


MGR6 




GPR6 




P2Y5 




B2AR 


GPR6I 






fit PR 


MGR7 




urKL 




P2Y9 




B3AR 


GPR62 






GLR 


MGR8 




MC3R 




PAR2 




D2DR 


GPR63 






GPL2 






MC4R 




PAR3 




D3DR 


GPR78 






GPR56 






MC5R 




THRR 




D4DR 


GPR84 






GRFR 






MSHR 








DADR 


GPR85 






PACR 














DBDR 


(GPR88) 






PTR2 














GPRI 02 


GPRL 






PTRR 














GPR57 


HHIR 






SCRC 














GPR58 


PSP24 






VIPR 














HH2R 


RE2 






VIPS 














PNR 


SREB3 











Very distantly related receptors that ire possibly not phytegenetkally related are shown in brackets. Orphan receptors are shown In bold. 
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yield the highest likelihood. Maximum-likelihood approaches 
tend to outperform alternative methods such as parsimony 
or distance-based methods. The main advantage is the appli- 
cation of a well defined model of sequence evolution to a 
given datasel [26]. Maximum likelihood is the estimation 
method least affected by sampling error and tends to be 
robust to many violations of the assumptions in the evolu- 
tionary model. The methods are statistically well founded, 
evaluate different tree topologies and use all sequence infor- 
mation available [27,28]. Because of their smaller size, fami- 
lies B and C could be subjected to these methods without 
prior subgrouping. This resulted in 19 phylogenetic trees, 
comprising 241 receptors for family A (Figures 3-6), one 
tree from 23 sequences for family B and one tree from 14 
sequences for family C (Figure 7). Family-A trees were 
rooted with the human family-B receptor GPRC5B and fam- 
ilies B and C with family- A receptor 5H1A. The sequence 
used to root the tree (the outgroup) is supposed to be a 
distant, though related, sequence. In some of our groups, 
the phylogenetic trees could not be fully resolved. This could 
be due to either very similar or very distant sequences. In 
both cases the phylogenetic signal is too weak to resolve the 
tree [29]. Several receptors (for example, TM7SF1, DUFF, 
GPR, GPRM, GPR75, GPR88, MAS and MRG) were found to 
be only distantly related to other known receptors used in 




Figure 3 

Chemokine receptors (subgroups Al and A2). Phylogenetic trees of the 
subgroups were inferred using Puzzle 5.0 corrected by the JTT 
substitution frequency matrix. Quartet-puzzling support percentage 
values from 10,000 puzzling steps are shown. The scale bars indicate a 
maximum likelihood branch length of 0.1 inferred substitutions per site. 
Orphan receptors are shaded 



our analysis. A possible explanation could be the previously 
proposed convergent evolution of this large protein family, 
meaning that these receptors have acquired the compelling 
similarity in their overall structures as a result of functional 
need, not phylogenetic relationship. The lack of significant 
sequence similarity among the different GPCR families 
favors this assumption [30-32]. Other explanations for the 
lack of significant sequence similarities might be an extra- 
ordinary divergence (genetic drift) or technical problems of 
the sequence-analysis methods used in analyzing polytopic 
membrane proteins or large protein families [33]. 

Receptor family A subgroups 

In contrast to the subfamilies presented in GPCRDB [34], a 
database widely used in the field, our grouping shows the 
orphan receptors within their respective subgroup and their 
relationship to receptors with known ligands. In addition, 
our method sometimes resulted in subgroups with members 
whose ligands belong to different substance classes. These 
results are discussed in more detail below. 

Chemokine receptors 

Groups Ai and A2 comprise the chemokine receptors 
(Figure 3). The chemokine ligand superfamily is defined by 
four conserved cysteines that form two disulfide bonds, and 
can be structurally subdivided into two major branches 
based on the spacing of the first cysteine pair. Chemokines 
in which these residues are adjacent form the CC subfamily 
(corresponding to the SWISS-PROT CKR nomenclature 
used here), and those separated by a single amino acid com- 
prise the CXC subfamily (here CCR and IL8R; for a review 
see [35]). We had to divide the whole subfamily into two 
groups to perform a detailed phylogenetic analysis. This sub- 
grouping produced the same dichotomy, as suggested by the 
two-ligand motifs, as another example of the parallel evolu- 
tion of receptors and ligands. Similar results describing this 
parallel evolution were found previously using a different 
computational approach [36]. 

Group Al mainly comprises the CC family. We hypothesize 
that the orphan receptor CKRX, which constitutes a separate 
branch related to CKRi, 2, 3 and 5, might also bind a CC 
ligand. In contrast, TM7SF1 in this group seems to be only 
distantly, if at all, related to family-A receptors. It was 
grouped according to BLASTP results, where a misleading 
local alignment of approximately 20 amino acids placed it in 
the vicinity of the chemokine receptors. Group A2 is more 
heterogeneous and comprises receptors for CC and CXC 
ligands, as well as an orphan receptor (ADMR) previously 
thought to bind the peptide adrenomedullin. 
Adrenomedullin has now been shown to bind a family-B 
receptor and is discussed further below. The orphan receptor 
RDCi in group A2 was first believed to be a receptor for 
vasointestinal peptide VIP [37], a notion not supported by 
phytogeny and later dismissed by experimental data [38]. 
Our results place it closer to the ADMR receptor than to the 
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Peptide receptors (subgroups A3-AI0). The scale bar Indicates a maximum likelihood branch length of 0.1 inferred substitutions per site. Orphan 
receptors are shaded. For method see Figure 2. 
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Figure 5 

Nucleotide and lipid receptors (subgroups A 1 1 -A 1 6). The scale bar indicates a maximum-likelihood branch length of 0.1 inferred substitutions per site. 
Orphan receptors are shaded. For method see Figure 2. 



typical chemokine receptors. CML2 is a typical, but distant, 
member of the chemokine receptor family. Hie DUFF recep- 
tor (the Duffy antigen) is also very distantly related and was 
only grouped into A2 by BIASTP results. 

Peptide receptors 

Group A3 consists of receptors for the small peptides 
angiotensin (8 amino acids), bradykinin (9 amino acids) and 
apelin (Figure 4). Four forms of apelin (12, 13, 17 and 36 
amino adds) have been described, but only those of 12 and 13 
amino acids bind in nanomolar concentrations [39]. The 
orphan receptors GPRF and GPR25 in this group are related 
as closely to the apelin receptor APJ as to the angiotensin or 
bradykinin receptors, and might also bind small peptides. 
GPRF acts as a co-receptor for the human immunodeficiency 
virus (HIV) [40], like the APJ receptor [41], which further 
hints at structural homology of the two ligands. Opioid and 
somatostatin receptors make up group A4- Both somatostatin 
and opioid peptides are derived from the processing of larger 
precursors. The somatostatins are cyclic peptides of 14 and 



28 amino adds. The opioid precursors preproenkephalin, 
preprodynorphin, prepro-opiomelanocortin and prepronod- 
ceptin display a strikingly similar general organization and a 
conserved arnino-terminal region that contains six cysteines, 
probably involved in disulfide bond formation. 

The processed neuropeptides, in contrast, are less similar to 
each other. It could be speculated that the receptors first 
bound the precursors themselves, and that the diversity 
derived from processing is evolutionarUy new. Processing 
prepronodceptin gives rise to two evolutionarily conserved 
peptides besides orphanin FQ, the ligand for OPRX. It has 
not been reported whether these peptides bind to the orphan 
receptors GPR7 and GPR8, which constitute a new branch 
related to the opioid receptors. 

In group A5 we find three receptors that bind the 30-amino- 
add peptide galanin, and related to these the GPR54 recep- 
tor, which is activated by the 54-* 14*. and 13-amino-acid 
peptides derived from the product of KiSS-i, a metastasis 
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suppressor gene for melanoma cells. These kisspeptins all 
share a common RF-amide caboxyl terminus. Although only 
distantly related to each other, both GPRO (melanin-concen- 
trating hormone) and UR2R (urotensin II peptide) bind 
cyclic peptides originally isolated from fish. Similarly distant 
is the orphan receptor SALPR, which shares sequence simi- 
larity with somatostatin (A4) and angiotensin (A3) recep- 
tors, but subgrouping of groups A4 and 5 by neighbor 
joining led to its placement in group 5. SALPR does not bind 
somatostatin or angiotensin ligands [42], but could bind 
another cyclic peptide. The P2Y7 receptor in group A5 does 
not bind nucleotides [43], as suggested by the name, but was 
published as a receptor for the lipid leukotriene B4 [44], a 
notion not supported by phytogeny. In addition, two new 
leukotriene receptors - CLTi and CLT2 - have been cloned 
and characterized during the preparation of this manuscript 
[45,46] and were found to be unrelated to P2Y7. 

Group A6 is again composed solely of receptors for peptide 
ligands. The orphan receptor GPR103 is related to the 
neuropeptide FF receptors that bind two amidated mam- 
malian neuropeptides - NPAF (A-18-F-araide) and NPFF (F- 
8-F-amide), also known as morphine-modulating peptides. 
These peptides, which may also be the ligand for GPR103, 
are members of a large family of neuropeptides related to the 
molluscan cardioexcitatory neuropeptide (FMRF-amide, 
Phe-Met-Arg-Phe-amide). The orphan receptors GPRM and 
GPR in group A6 are most probably also peptide receptors, 
but are only very distantly related to the others and show no 



relationship to receptors with known ligands. Group A7 is 
also composed of receptors for peptide ligands: neuromedin, 
neurotensin, motilin, endothelin, bombesin and the releas- 
ing hormones for growth hormone and thyrotropin. GPR39 
might bind a small peptide ligand like the closely related 
neurotensin receptors NTRi and 2, which binds a 13-amino- 
acid peptide derived from a larger precursor protein. GPR37 
and ETBR-LP2 are related to each other and branch off the 
endothelin receptors that bind characteristic bicyclic pep- 
tides of 21 amino acids containing four cysteines linked by 
two disulfide bonds. 

Group A8 has two branches with receptors with known 
ligands. These receptors bind the structurally diverse but 
functionally related chemotactic substances N-formyl- 
methionyi and the anaphylatoxic complement factors. The 
N-formylmethionyl ligands are small hydrophilic peptides of 
bacterial origin, but recently a number of new peptide ago- 
nists have been identified that selectively activate the high- 
affinity fMLF receptor FPR and/or its low-affinity variant 
FPRU. These agonists include peptide domains derived 
from the envelope proteins of HIV type 1 and at least three 
amyloidogenic polypeptides, the human acute-phase protein 
serum amyloid A, the 42-amino-acid form of beta-amyloid 
peptide and a 21-amino-acid fragment of the human prion 
protein. Furthermore, a cleavage fragment of neutrophil 
granule-derived bactericidal cathelicidin, LL-37, is also a 
chemotactic agonist for FPRLi (for a review see [47])- The 
complement factors C3a and Csa are large but highly 
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Figure 7 

Families B and C of the G -protein -coupled receptors (GPRCs). Phyiogenetic trees of families B and C were inferred using Puzzle 5.0 corrected by the 
JTT substitution frequency matrix. Quartet-puzzling support percentage values from 1 0.000 puzzling steps are shown. The scale bar indicates a maximum 
likelihood branch length of 0. 1 inferred substitutions per site. Orphan receptors are shaded. 



hydrophilic proteins with a mainly alpha-helical structure 
held together by three disulfide bridges. Csa is rapidly 
desarginated to the less potent derivative Q>adR74, which is 
the ligand for the C5L2 receptor. The orphan receptors 
GPRi, CMLi and GPR44 all cluster, and constitute a sepa- 
rate branch as distant as the other two branches. No predic- 
tion of the possible structure of the ligands for these 
receptors can be derived from this tree, but maybe they will 
function as chemotactic peptides. This could at least hint at 
leukocytes or inflamed tissue as a possible source for these 
ligands. Hie receptor GPRW constitutes its own branch, not 
as distant to the main group as the MAS oncogene product 
and the related receptor MRG, which are only very distantly 
related to the group. 

All receptors in group A9 with known ligands bind peptides, 
except for a side branch consisting of receptors for the bio- 
genic amine melatonin. The orphan receptor MLiX is closely 
related to melatonin receptors MLiA and B, but apparently 
does not bind melatonin [48]. GPR73 is related to the neuro- 
peptide Y (NPY) receptor NY2R which mainly binds the pan- 
creatic peptide YY of 36 amino acids, and these two are 
placed together on a branch distinct from the NPY receptors 
NY4R and NYiR. GPR73 does not bind the NPY ligand 
family [49]. but possibly a similar large peptide ligand. 
The orphan receptors GPR72 and GPRJ constitute a new 



subgroup that most probably bind related peptide ligands. 
GPR72 does not bind a NPY ligand [49]. GPR75 is only very 
distantly related to the whole A9 group. The receptors for the 
glycoprotein hormones thyroid-stimulating hormone (TSH), 
luteinizing hormone (LSH) and follicle-stimulating hormone 
(FSH) make up Group A10. GPR48 and 49 are very similar 
in their overall structure, with long amino termini, but their 
relationship is also evident in the neighbor-joining tree con- 
structed from alignments without amino and carboxyl 
termini. It has been recently shown that these receptors 
mediate the action of relaxin, a peptide hormone of the 
insulin-like growth factor family secreted by the corpus 
luteum during pregnancy [50]. 

Nucleotide and Bpid receptors 

The receptors with known ligands in group An are the P2Y 
receptors, which bind pyrimidine as well as purine nucleotides 
(Figure 5). Several orphan receptors constitute new clusters. 
GPR80 and GPR91 are distantly related to each other and rel- 
atively close to the P2Y receptors. GPR80 is the closest relative 
of the newly identified CLT2 receptor for leukotrienes as 
judged by BLASTP results. GPR81, HM74 and GPRV and GPR 
40-43 belong to branches only distantly related to P2Y recep- 
tors. Within these potential new subfamilies, GPR41-43, 
GPR81 and HM74 are more closely related to each other than 
to GPR40 (for GPR41-43) and GPRV (for GPR81 and HM74). 
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In group A12, the platelet-activated receptor, a lipid receptor 
and receptors activated by nucleotides mingle, but are found 
on different side branches. The orphan receptor GPR87 is 
closely related to the receptor for UDP-glucose KI01 and to 
the ADP-binding receptors P2Y12 and GPR86. We assume 
that this receptor might also bind UDP-glucose or another 
modified nucleotide. GPR34 is distantly related to the 
platelet-activating factor (PAF) receptor; it was not activated 
by available lipid ligands [51], but might nevertheless bind a 
lipid ligand. Group A13 contains both peptide and lipid 
receptors but they make up different branches. The peptide 
branch binds peptides derived from the processing of pro- 
opiomelanocortin that gives rise to peptides of between 12 
and 36 amino acids. The EDG and cannabinoid receptors 
constitute clusters, and one cluster distinct from the other 
three consists of the orphan receptors GPR3, GPR6 and 
GPRC, which have been grouped closer to the lipid EDG 
receptors in the overall neighbor-joining tree (Figure 2). 
This information helped to identify a phospholipid ligand for 
GPRC (H. Chica Schaller, personal communication). 

The receptors in group A14 all bind ligands derived from 
arachidonic acid by the action of cyclooxygenase. These 
receptors for lipid-derived autacoids or prostanoids com- 
prise receptors for the prostaglandins and thromboxanes. 
There are no orphan receptors in this group. Group A15 is a 
very heterogenous group composed of receptors for the 
lipids sphingosylphosphorylcholine (SPC), lysophos- 
phatidylcholine (LPC) and psychosine, and receptors acti- 
vated by proteases. GPR4 and GPR68 both bind SPC, like 
the EDG receptor branch consisting of the EDGi, 3, 6 and 
8 receptors in A13, but are not closely related. Protease- 
activated receptors become activated by a part of the 
former amino terminus cleaved by the protease. The new 
amino terminus then functions as a tethered ligand and 
activates the receptor. This can be mimicked by very small 
peptides derived from this ligand; such receptors should 
therefore rather resemble peptide receptors. The orphans 
P2Y5, P2Y9 and P2Y10 receptors were not placed in group 
11 and 12 like most P2Y receptors, but in group A15, sup- 
porting the fact that they were misnamed. P2Y5 and P2Y9 
do not bind nucleotides [52,53], *> ut this has not been 
shown yet for P2Y10. All other orphan receptors in this 
group, with the exception of GPR35 and GPR55 which 
cluster together, are as distantly related to each other as to 
the receptors with known ligands. Group A16 contains the 
opsins, receptors that are activated by isoprenoid ligands, 
and no orphan receptors. 

Biogenic amine receptors 

Some serotonin receptors and receptors for the biogenic 
amines adrenaline, dopamine and histamine are all placed 
on different branches in group A17 (Figure 6). An additional 
branch consists of the orphan receptors GPR102, PNR, 
GPR57 and GPR58, which are as distantly related to the 
others as, for example, is the alpha-adrenergic receptor 



branch. PNR and GPR58 expressed in COS cells did not bind 
various serotonin receptor-specific ligands [54]. Their 
ligands might be small molecules with similar properties. 
Group A18 is very heterogeneous and consists of receptors 
for the biogenic amines acetylcholine and adenosine, and the 
HHiR receptor for histamine, as well as many distantly 
related orphan GPCRs. GPR63 is closely related to the 
orphan receptor PSP24. The Xenopus laevis homolog of this 
receptor binds LPA [55]. GPR101 and RE2, GPRL and 
GPR52, and GPR61 and GPR62 constitute their own sub- 
groups. In particular, the SREB1-3 cluster (GPR85, GPR27 
and SREB3) makes up its own family, with only a distant 
relationship to other GPCRs in this group. No orphan recep- 
tors are found in group A19, which consists entirely of sero- 
tonin receptors distinct from those in A 17. 

During the preparation of this manuscript several new 
family-A receptors that could not be fitted into our analysis 
were identified. These comprise 15 new receptors distinct 
from the classical biogenic amine receptors that apparently 
bind the trace amines tyramine, (i-phenylethylamine, trypta- 
mine and octopamine [56]. In addition, a new subfamily of 
GPCRs related to the mas oncogene and uniquely expressed 
in small nociceptive sensory neurons were shown to be the 
receptors for a number of enkephalin fragments [57]. 

Receptor families B and C 

Family B (Figure 7) was named after the secretin receptor. 
Yet proteins showing homology to this receptor make up 
only one of four distantly related subgroups. The receptors 
EMRi, EMR2 and EMR3, and the CD97 surface antigen, all 
have several epidermal growth factor (EGF)-like domains in 
the extracellular amino terminus. They constitute their own 
cluster only distantly related to the rest of the family. The 
same applies to the brain-specific angiogenesis inhibitor 
family BAI1-3. GPR56 was assigned to family B because it 
shows the typical signature [58], but is so far the only one of 
its kind. So for no non-protein ligand has been identified as 
a ligand for femily-B receptors. Astonishingly, one family-B 
receptor, namely the CGRP receptor, requires coexpression 
with single transmembrane receptor activity-modifying 
proteins (RAMP1-3) for ligand binding and signal transduc- 
tion [59]. Coexpression of different RAMPs results in 
binding of different cyclic peptide ligands such as 
adrenomedullin, amylin or the calcitonin gene-related 
peptide (for a review see [60]). This could further compli- 
cate the identification of the cognate ligands for these 
family-B orphan receptors, but we assume that they will 
also bind large peptide ligands. In family C (Figure 7)» the 
metabotropic glutamate receptors MGR1-8 bind the small 
molecule glutamate, the CASR receptor senses extracellular 
calcium concentration, and receptors GBR1-2 bind the 
small molecule gamma-amino butyric acid (GABA). 
GPRC5B, C and D constitute their own subgroup with no 
closer relationship to the other members, but might also 
bind small molecules. 
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Conclusions 

In this work, we calculated the phylogenetic distances of 277 
human GPCRs and show the relationship of orphan recep- 
tors to receptors for known ligands with support values for 
each branch. We then grouped orphan receptors and recep- 
tors with known ligands into 19 subgroups that sometimes 
differ from previous classifications. Three subgroups are 
composed of receptors for ligands that belong to different 
substance classes; for example, in group A12, lipid receptors 
and receptors activated by nucleotides mingle, and in groups 
A13 and A15, peptide and lipid receptors. In both subgroups 
the receptors binding ligands of different substance classes 
make up different branches. We hope that this approach 
proves valuable for identifying the natural ligands of orphan 
receptors, as related receptors have previously been shown 
to have ligands with similar structural features. 

Materials and methods 
Sequence database mining 

A database search excluding olfactory and gustatory recep- 
tors identified the amino-acid sequences of 281 human 
GPCRs. Only sequences annotated as GPCRs in the following 
databases were used: NCBI [61], SWISS-PROT [62], EMBL 
[63] and GPCRDB [34,64]. Receptors without published 
ligands in PubMed [65] were defined as orphan GPCRs. 

Multiple sequence alignments 

Multiple protein sequences were aligned with ClustalX 1.81 
[66]. Pairwise alignment parameters were set as: slow/accu- 
rate alignment; gap opening penalty to; gap extension 
penalty 0.10; protein weight matrix BLOSUM 30. Multiple 
alignment parameters were set as: gap opening penalty 10; 
gap extension penalty 0.05; delay divergent sequences 35%; 
protein weight matrix BLOSUM series [67]. The alignments 
were modified by deleting the extremely variable amino 
termini upstream of the first transmembrane domain and 
carboxyl termini downstream of the seventh transmembrane 
domain. Alignment editing and shading was done using 
BioEdit Sequence Alignment Editor [68] and GeneDoc Mul- 
tiple Sequence Alignment Editor [69]. Transmembrane 
domains were identified using the TMpred program [70] 
and, where available, data from the original publication [71]. 

Clustering of subgroups 

An overall phylogenetic tree of family A was inferred from 
the multiple sequence alignment with PHYLIP 3.6 [72]. 
Bootstrapping was performed 1,000 times using SEQBOOT 
to obtain support values for each internal branch. Pairwise 
distances were determined with PROTDIST and the JTT 
substitution frequency matrix [73]. Neighbor-joining phylo- 
genetic trees [21] were calculated with NEIGHBOR using 
standard parameters. The human GPRC5B receptor belong- 
ing to family B was used as outgroup for family A. The out- 
group sequence is supposed to be a distant, though related, 
sequence and is used to root the tree. The majority-rule 



consensus trees of all bootstrapped sequences were obtained 
with the program CONSENSE. Representations of the calcu- 
lated trees were constructed with Tree View [74]. Clusters with 
bootstrap values greater than 50% were defined as confirmed 
subgroups, and sequences with lower values added to these 
subgroups according to their sequence similarity in the align- 
ment as judged by visual inspection and the results of pairwise 
local alignments with all other sequences by BLASTP [25]. The 
p-value was used as a measure of similarity. 

Quartet-puzzling trees 

Multiple protein sequence alignments of these new subgroups 
were created as described above. Phylogenetic trees were 
inferred from these alignments using Puzzle 5.0 [75] to calcu- 
late maximum-likelihood distances corrected by the JTT sub- 
stitution-frequency matrix [73] with amino-acid usage 
estimated from the data, site-to-site rate variation modeled 
on a gamma distribution with eight rate categories plus 
invariant sites, and the shape parameter estimated from the 
data. The human GPRC5B receptor of family B was used as an 
outgroup for family A. The human 5H1A receptor of family A 
was used as an outgroup for families B and C (the outgroups 
are not shown in the figures here). Quartet-puzzling (QP) 
trees were constructed with the described settings and 10,000 
puzzling steps to obtain support values (QP reliability) for 
each internal branch. The program Puzzle 5 0 was used in a 
parallelized version (ppuzzle) with a message-passing inter- 
face (MPi) implementation on a HP 9000 N-Class Enterprise 
Server Cluster consisting of five HP 9000 N-Class shared- 
memory multiprocessor systems with eight PA-RISC 8600 
(552 MHz) processors each. Representations of the quartet- 
puzzling trees were constructed with Tree View [74]. 

Additional data files 

Additional data files available with the online verson of this 
paper include a data table with names, synonyms and acces- 
sion numbers of all GPCRs, and the BLASTP results of all 
GPCRs (full-length sequences and sequences without amino 
or carboxyl termini). 
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Tyramine, p-phenylethylamine, tryptamine, and octopamine are 
biogenic amines present in trace levels in mammalian nervous 
systems. Although some "trace amines" have clearly defined roles 
as neurotransmitters in invertebrates, the extent to which they 
function as true neurotransmitters in vertebrates has remained 
speculative. Using a degenerate PCR approach, we have identified 
15 G proteln<oupled receptors (GPCR) from human and rodent 
tissues. Together with the orphan receptor PNR, these receptors 
form a subfamily of rhodopsin GPCRs distinct from, but related to 
the classical biogenic amine receptors. We have demonstrated that 
two of these receptors bind and/or are activated by trace amines. 
The cloning of mammalian GPCRs for trace amines supports a role 
for trace amines as neurotransmitters in vertebrates. Three of the 
four human receptors from this family are present in the amygdala, 
possibly linking trace amine receptors to affective disorders. The 
identification of this family of receptors should rekindle the inves- 
tigation of the roles of trace amines in mammalian nervous systems 
and may potentially lead to the development of novel therapeutics 
for a variety of indications. 

Norepinephrine (NE), dopamine (DA), and serotonin (5- 
HT) are classical biogenic amine neurotransmitters whose 
well characterized effects are mediated by interactions with 
subfamilies of receptors that belong to the rhodopsin superfam- 
ily of G protein-coupled receptors (GPCRs). In addition to these 
classical amines, there exists a class of "trace amines" that are 
found in very low levels in mammalian tissues, and include 
tyramine, ^phenylethylamine (/3-PEA), tryptamine, and octo- 
pamine (1). The rapid turnover of trace amines, as evidenced by 
their dramatic increases following treatment with monoamine 
oxidase (MAO) inhibitors or deletion of the MAO genes, 
suggests that the levels of trace amines at neuronal synapses may 
be considerably higher than predicted by steady-state measures 
(2-5). The role of trace amines as neurotransmitters in inverte- 
brates is well established and octopamine is thought to be the 
sympathetic nervous system counterpart to NE (6-9). GPCRs 
for tyramine and octopamine have been cloned from both insects 
(10-14) and moUusks (15, 16). 

Although there b clinical literature that supports a role for trace 
amines in depression as well as other psychiatric disorders and 
migraine (2, 3, 17-20), the role of trace amines as neurotransmitters 
in mammalian systems has not been thoroughly examined- Because 
they share common structures with the classical amines and can 
displace other amines from their storage vesicles, trace amines have 
been referred to as 'false transmitters" (21). Thus, many of the 
effects of trace amines are indirect and are caused by the release of 
endogenous classical amines. However, there is a growing body 
of evidence suggesting that trace amines function independently of 
classical amine transmitters and mediate some of their effects via 
specific receptors (for review, see refe. 22-24). Saturable, high- 
affinity binding sites for pH]tryptamine (23, 25-27), />-pH]tyra- 
mine (28-30), and 0-[ 3 H]PEA (31 ) have been reported in rat brain, 
and both the pharmacology and localization of these sites suggest 
that they are distinct from the amine transporters. However, 
although binding sites in brain and other tissues have been reported, 



no specific receptors for these trace amines have yet been identified 
conclusively. 

We now report the identification of a family of related mamma- 
lian GPCRs of which two members have been shown to specifically 
bind and/or be activated by trace amines. TA t is activated most 
potently by tyramine and 0-PEA, and TA 2 is activated most 
potently by 0-PEA. The 15 distinct receptors described here, along 
with the orphan receptor PNR (32) and the pseudogenes GPR58, 
GPR57 (33), and the 5-HT.j pseudogene (34), share a high degree 
of sequence homology and together form a subfamily of rhodopsin 
GPCRs distinct from but related to 5-HT, DA, and NE receptors. 
We further describe the localization of TAj in human and rodent 
tissues, as well as the chromosomal localization of the human 
members of this family. The identification of this family of receptors 
should facilitate the understanding of the roles of trace amines in 
the mammalian nervous system. 

Materials and Methods 

Degenerate PCR. To clone a rat TAj fragment, PCR was per- 
formed on genomic DNA by using primers designed based on an 
alignment of the sixth (5'-TNNKNTGYTGGYTNCCNT- 
TYTTY-3') and seventh (5'-ARNSWRTTNVNRTANCC- 
NARCC-3') transmembrane (TM) domains of a subset of 5-HT 
receptors. To clone rat TA», human TA 5 , rat TA 7 , rat TA^ and 
rat TAo, PCR was performed on genomic DNA by using primers 
designed based on an alignment of the first intracellular loop and 
TMII (5 '-TT Y A ARCAR YTNC AYWSNCCNAC-3 ' ) or the 
first extracellular loop (5'-GARHVNTGYTGGTAYTTYGG- 
3') and TMVI (5'-ATNCCNARNGTYTTNRCNGCYTT-3' or 
5 ' -CCA RC ANRNN ARR A AN ACNCC-3 ' ) of TA,, GPR58, 
and GPR57. To clone TA 2 , PCR was performed on rat genomic 
DNA by using primers designed based on an alignment of the 
first intracellular loop TMII (5 # -TTYAARSMNYTNCAY- 
WSNCCNAC-3') and the first extracellular loop (5'- 
CCRAARWACCARCANBNYTCNRY-3) of TA,, TAi, 
GPR58, PNR, and the 5-HT« pseudogene. For the cloning of a 
rat TAj fragment, PCR was performed on genomic DNA by 
using primers designed based on an alignment of TMVI (5'- 
GYNTWYRYNNTNWSNTGGHTNCC-3') and TMVII (5'« 
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AVNADNGBRWAVANNANNGGRTT-3') of a collection of 
rhodopsin GPCRs. PCR conditions were: 94°C for 5 min; 10 
cycles of 94°C for 30 s, 44°C for 45 s or 43°C for 1 min (TAi), 72°C 
for 1 min 45 s; 30 cycles of 94°C for 30 s, 49°C for 45 s or 48°C 
for 1 min (TA,), 72°C for 1 min 45 s; 72°C for 20 min. PCR 
products were subcloncd into the TA cloning vector (Invitro- 
gen), sequenced (Big Dye cycle sequencing protocol and ABI 
377 sequencers from Applied Biosystcms), and analyzed 
(Wisc onsin Package, Genetics Computer Group. Madison. Wl). 

Library Screening, Rat liver or human placenta! genomic phage 
libraries (Stratagene) or a rat cosmid library (CLONTECH) 
were screened by using 32 P-labeled oligonucleotide probes and 
standard protocols. Positive signals were isolated and hybridizing 
bands, identified by Southern blot analysis, were subcloned into 
pcDNA3.1 (Invitrogen) or a modified form of pcEXV (35) and 
sequenced as above. 

Low Stringency PCR. Fragments of species homologues of TAj 
were amplified from genomic DNA using primers designed 
against the rat TA,. PCR was performed with the Expand Long 
Template PCR System (Roche Molecular Biochemicals) with an 
annealing temperature of 45-5 1°C. 

Rapid Amplification of cDNA Ends (RACE). 5' and 3' RACE were 
performed according to the manufacturer's protocol, using 
Marathon-Ready cDNA (CLONTECH) from kidney and stom- 
ach (human TAi), kidney and testes (rat TA 2 ), spinal cord (rat 
TA 3 ), and brain (mouse TA,). Coding regions were amplified 
multiple times from genomic DNA, human amygdala cDNA, or 
rat testes cDNA by using primers specific to the 5' and 3' 
untranslated regions. 

Oocyte Injection and Recording. Oocytes were isolated from Xenopus 
bevis (Xenopus 1, Ann Arbor, MI) and maintained, injected, 
incubated, and recorded from as described (36). Oocytes were 
injected with 10-15 ng of mRNA encoding TAi with or without 10 
ng of mRNA encoding the cystic fibrosis transmembrane conduc- 
tance regulator (CFTR; ref. 37). Ligands were applied by local 
perfusion from a 10-/il glass capillary tube 0.5 mm from the oocyte. 

Measurement of Intracellular cAMP. Transiently transfected COS-7 
cells were incubated in Dulbecco's PBS supplemented with 10 
mM Hepes, 10 mM glucose, 5 mM theophylline, and 10 $M 
pargyline for 20 min at 37°C in 95% Oj/5% CO* Test com- 
pounds were added and cells were incubated for 10 min. The 
medium was aspirated and the reaction stopped by the addition 
of 200 |il of 100-mM HO. The cAMP content in each well was 
measured by RIA (Scintillation Proximity Assay, Amersham 
Pharmacia Biotech) using a microbeta Trilux counter (Wallac, 
Gaithersburg, MD). 

Radioligand Binding. Membranes prepared from cells transiently 
transfected with human TAi and rat Go, were diluted in 25 mM 
Gly-Gly buffer (Sigma, pH 7.4 at 0°Q containing 5 mM ascor- 
bate (final protein concentration « 120 pg/ml). Membranes 
were then incubated with [ 3 H]tyramine [American Radiochemi- 
cals, St. Louis; specific activity 60 mCi/jiM (1 Ci « 37 GBq)] in 
the presence or absence of competing ligands on ice for 30 min 
in a volume of 250 $d. Bound ligand was separated from free 
ligand by filtration through GF/B filters presoaked in 05% 
polyethyleneimine, using a Brandel (Bethesda, MD) cell har- 
vester vacuum filtration device, and bound radioactivity quan- 
tified by using a scintillation counter. Data were fit to nonlinear 
curves by using prism (GraphPad, San Diego). 

Quantitative Reverse Transcription (RD-PCR. cDNA was prepared 
from DNase-treated total RNA purchased from CLONTECH or 



isolated from human tissues by using TRIzol reagent (Lite 
Technologies, Grand Island, NY). Integrity of RNA and cDNA 
was assessed by amplification of cyclophilin or glyceraldehyde- 
3-phosphate dehydrogenase (GAPDH). PCR reactions were 
carried out in a PE7700 sequence detection system (Perkin- 
Elmer) according to the manufacture's protocol. The probe 
|5'(6-FAM)-ATCiGTGAGATCTGCTGACCACTGTTCG- 
TATT-(TAMRA)3'] was labeled with FAM (6-carboxyfluorcs- 
cein) as the reporter and TA MR A (6-carhoxy-4 ? 7,2.7'- 
telramethvlrhodamine) as a quencher, and the forward and 
reverse PCR primers were 5'-CATGGCCACTGTGGACTT- 
TCT-3' and 5 ' -GTCGGTGCTTGTGTG A ATTTTAC A-3 ' . re- 
spectively. The fluorescent signal from each well was normalized 
by using an internal passive reference, and data were fitted to a 
standard curve generated with genomic DNA. 

Chromosomal Localization. The Stanford Human Genome Center 
(SHGC) G3 panel of 83 radiation hybrids was analyzed by PCR 
using 20 ng of DNA and the same primers, probes, and thermal 
cycler profiles as used for localization. The RH Server (at 
SHGC; www-SHGCstanford.edu) and the National Center for 
Biotechnology Information's LocusLink and GeneMap '99 were 
used for analysis. 

In Shu Hybridization Histochemistry. Sense and antisense ribo- 
probes (251 bp, TMV-TMVI of mouse TAj) were labeled with 
digoxigenin as outlined in the DIG/Genius System (Roche 
Molecular Biochemicals). Male 129S6/SVEV mice (20 g, Tac- 
onic Farms) were anesthetized with ket amine 20 mg/kg (Re- 
search Biochemicals) and xylazine 0.2 mg/kg (Sigma), and 
perfused transcardially with PBS followed by 4% paraformal- 
dehyde/PBS. Tissues were cryoprotected, stored at -20°C, and 
sectioned (30 /iM) by using a freezing microtome. Free-floating 
sections were incubated in 100 mM glycine for 5 min and 0.3% 
Triton X-100 for 15 min, then rinsed twice in PBS for 5 min. In 
situ hybridization histochemistry was carried out on free-floating 
tissue sections as outlined in the DIG/Genius System with a 
hybridization temperature of 52 6 C in a buffer containing 40% 
formamide. 

Results 

In an attempt to identify additional 5-HTi-like receptors, such as 
the elusive 5-HTlp receptor (38), degenerate PCR primers were 
designed against TMs VI and VII of an alignment of 5-HT, 
receptors and used to amplify rat genomic DNA at reduced 
stringency. One product from this reaction was found to be a 
DNA sequence, not found in the GenBank database, with 
42-48% amino acid identity to 5-HT4, DA D 2 , and ^adrenergic 
receptors. Sequencing of the corresponding full-length cDNA, 
BOl 11, revealed an ORF of 996 bp that is predicted to encode 
a protein of 332 aa (Fig. 1, rat TA,). An allelic variant of this 
receptor was also identified wherein a glutamine replaces a 
leucine at position 170. BOlll is most closely related to GPR58 
(50% aa identity), the human 5-HT 4 pseudogene (47% aa 
identity, with frame shifts -corrected"), BO107 (an orphan 
GPCR previously identified at Synaptic and later renamed TA 3 ) 
and GPR57 (45% aa identities), PNR (38% aa identity), and 
5-HTid, 5-HT4, and 5-HT 7 receptors (35-37% aa identities). 
Human and mouse orthologues of BOlll were obtained by 
standard methods. The amino acid sequences of the human and 
mouse receptors are 76% identical to each other and 79% and 
87% identical to the rat receptor, respectively (Fig. 1). 

A search for the endogenous ligand for the receptor encoded 
by BOl 1 1 was performed by expressing it in oocytes alo ng with 
mRNA encoding the cA MP- responsive CI channel, CFTR. 
Candidate ligands were tested in eleven groups of five. From this 
broad panel, octopamine and, more weakly, DA and 5-HT, 
elicited inward currents at 100 i*M (Fig. 24). Stimulation by 
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Fig. 1. Alignment of rat mouse, and human TA, and ratTA, receptors (GenBank accession nos. AF380186. AF380187, AF380185. and AF380188, 

Shaded residues are conserved in all four receptors. Triangles and circles indicate residues conserved in TArTA*. Open tnangles are a so conserved among all 

^Zl,k receptors, and open circles are conserved among all human 5-HT but not NE or OA receptors. Seven putat.ve TM doma.ns are ,nd,cated. 



octopamine (100 /iM) produced an average current amplitude of 
230 ± 55 nA (n = 4). Similar currents were generated by 
tyramine at a lower concentration (100 nM; 287 ± 31 nA, n = 
28; Fig. IB). EC$o values were obtained for octopamine (635 ± 
151 nM) and tyramine (37 ± 4.4 nM) from cumulative concen- 
tration effect responses (data not shown). These results sug- 
gested that BOlll encodes a receptor for trace amines, and was 
thus named TAi. No such currents were observed in oocytes 
injected with only mRNA encoding the CFTR channel. Oocytes 
expressing rat TAi without CFTR failed to generate inward 




Fig. 2. Voltage-damp responses to trace amines In oocytes. (A) Respo nse to 100 
(iM octopamine (Oct) or 5-HT in an oocyte expressing rat TAi and CFTR. (B) 
Response to 100 nM tyramine fTyr) in an oocyte expressing rat TA, and CFTR. (Q 
Response to 100 jiM octopamine in an oocyte expressing only TA,. (D) Response 
to 100 nM tyramine (Tyr) in an oocyte expressing human TA, and CFTR. Holding 
potential was -80 mV for an oocytes. Marker bar in O also applies to A and C. 



currents (Fig. 2C; n = 1 1), suggesting that stimulation of rat TAi 
by octopamine and tyramine resulted in the generation of cAMP 
leading to CFTR channel opening, presumably via activation of 
the endogenous Xenopus G protein Goc s . Oocytes expressing the 
human orthologue of rat TAi with CFTR also produced inward 
currents in response to application of 100 nM tyramine (Fig. 2D). 

Additional bioamines were tested for activity at human TAi 
expressed in mammalian cells. Human TAi was activated most 
potently by 0-PEA and tyramine, and more weakly by octopam- 
ine and DA (Table 1 and Fig. 3). The agonists listed in Table 1 
produced an increase in intracellular cAMP accumulation, likely 
via the Go,-class of G proteins in COS-7 cells transfected with 
human TAi, but not in mock-transfccted cells. 

Consistent with the relatively high potency of tyramine for 
activating human TAi, pH]tyramine demonstrated high-affinity, 
saturable binding m TAi-cxpressing membranes (average = 
20 nM; data not shown). Selectivity of human TAi for 0-PEA 



Table 1. Pharmacological profile of human TAi 




Compound 


JC b nM 


EC»nM 


0-PEA 


8.0 + 3.2 


324* 110 


Tyramine 


34 ± 11 


214 t 67 


Dopamine 


422 ±11 


6,700 ± 1,700 


Octopamine 


493 ± 99 


4,029 * 75 


Tryptamine 


1,084 ± 159 


>6 uM 


Histamine 


3,107 * 1,593 


>5uM 


Serotonin 


>6 uM 


>10uM 


Norepinephrine 


>10uM 


>5uM 



Values represent the average ± SEM from n * 3 experiments. Binding Kfl 
were determined from displacement of [ s H)tyramine (20 nM) and EC» values 
were determined by Increases in cAMP accumulation. 
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Fig. 3. 0-PEA-induced responses in COS-7 cells Uansferted with human TA,, 
rat TA*. or vector. Cells were incubated with increasing concentrations of 
0-PEA and cAMP accumulation measured. Data are from duplicate determi- 
nations and are representative of three to six experiments. 

and tyramine is shown in Tabic l.The rank order of potency was 
similar between the binding and functional assays, although the 
K\ values determined from binding displacement were ~6-fold 
lower than EQn values determined in functional studies. This 
difference has been reported for many exogenousty expressed 
receptors, including the 5-HT 7 receptor (39), and may be due to 
the relatively low expression levels of TAj in COS-7 cells, weak 
coupling of the receptor to signaling components in these cells, 
or differences in assay conditions. 

Additional degenerate PCR work performed on rat genomic 
DNA led to the identification of TA 2 . TA 2 is most similar to the 
human 5-HT 4 pseudogene (82% aa identity with frame shifts 
"corrected"), and shares 48-51% aa identity to the rat and 
human TA, receptors (Fig. 1), GPR57 and GPR58. The expres- 
sion of rat TA 2 in COS-7 cells resulted in an increase in cAMP 
accumulation, presumably via Ga s -class G protetn(s). Of the 
biogenic amines tested, only 0-PEA and tryptamine activated 
this receptor, however, the response was of low potency [EC50 = 
1.9 ± 0.5 tiM (Fig. 3) and 17 ± 2 u-M (data not shown) 
respectively]. The low potency of trace amines for rat TA 2 in 
heterologous expression systems may be explained by its poor 
surface expression, as determined by subcellular localization of 
an epitope-tagged rat TA 2 (data not shown). Alternatively, other 
more potent agonists may exist for rat TA* Because the human 
orthologue of this receptor is most likely the 5-HT 4 pseudogene, 
no further studies were conducted on rat TA 2 . 

Further degenerate PCR work led to the identification of TAs. 
from human genomic DNA, and TA*, TA 7 , TA& and TA* from 
rat genomic DNA. While isolating these full-length receptors 
from genomic libraries, several additional closely related recep- 
tors were also isolated, including human TA4 and rat TA«, TA jr* 
TA,„ TAi* TAij. TA,4, and TA15. TA4-TA13 are highly homol- 
ogous to each other, with overall aa identities of 62-96% (see 
Fig. 6, which is published as supplemental data on the PNAS web 
site, www.pnas.org). These receptors are 66-73% identical to 
TAj, 41-48% identical to TA* 40-44% identical to TAi, and 
28-36% identical to 5-HT receptors. As indicated in Fig. 1, there 
are 74 residues that are completely conserved in TA1-TA15. Of 
these, 52 are uniquely conserved in the trace amine family 
(closed triangles), 18 (of 25) are also conserved in all human 
monoaminergic receptors (open triangles), and an additional 14 
(of 32) are conserved with human 5-HT receptors, but not NE 
or DA receptors (open circles). 

A phytogenetic tree was constructed from the aa sequences of 
TA|-TA I5 , PNR, GPR57, GPR58, the 5-HT 4 pseudogene, and 
several vertebrate and invertebrate aminergic receptors (Fig. 4). 
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Fig. 4. A phylogenetic tree for trace amine receptors TAi-TA 15 , human 5-HT 
receptors 5-HT 1a . 5-HT !b . 5-HTid. 5-HT*. 5-HT*. 5-HT 7 , human ala receptor 
(AR-aiJ, GPR57, GPR58. PNR, 5-HT 4 psuedogene (5-HTu). drosophila (Dros- 
oph) receptors for octopamine (Oct), 5-HT, and tyramine (Tyr), Caenorhabditis 
elegans (C Elegans) 5-HT receptor, tyramine receptors from bee (Bee Tyr) and 
locust (Locust Tyr), and a snail octopamine receptor (Snail Oct). Amino acid 
sequences for each receptor spanning from the start of TMI to the end of 
TMVll were aligned by using the cuistaiw algorithm and the tree constructed 
by the NJ method on a Oecyphertl Bloaccelerator (Time log k, Reno. NV). 
GenBanfc accession nos.: AF380190 (rat TAj), AF 380 189 (human TAj). 
AF380191 (rat TA4), AF380192 (human TA4}, AF3801 93-AF380203 (TAf-TAis). 

The TA receptors, along with GPR57, GPR58, 5-HT 4 pseudogene, 
and PNR, branch separately from mammalian receptors for clas- 
sical biogenic amines, including those for 5-HT, and from the 
invertebrate trace amine receptors. Within this large family of 
receptors, there appeals to be at least two subfamilies. TAi and 
TA2, along with GPR57, GPR58, and 5-HT 4 pseudogene, consti- 
tute one subfamily, and TAa-TAis constitute a second subfamily. 

Radiation hybrid mapping using primers selective for TAi, 
TA* TAa, and TA5 was used to identify the chromosomal 
localization of the human trace amine receptors. All four genes 
had virtually identical patterns and mapped to SHGC-1836. This 
placed the TA receptor genes in the region of chromosome 
6q23.2. Interestingly, PNR, GPR57, GPR58, and 5-HT 4 pseu- 
dogene were previously shown to be clustered between 6q22 and 
6q24 by using fluorescence in situ hybridization analysis (32, 33). 

Human TAj mRNA was detected by quantitative reverse 
transcription (RT>PCR in low levels in discrete regions within 
the central nervous system (CNS) and in several peripheral 
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Fig. 5. Photomicrographs showing hybridization signals for TAi mRNA in 
mouse CNS. Signal detected in cerebellar Purkinje cells (arrows) hybridized 
with antisense (A and 8) and sense (O probes. Scale bar in C <200 tixn) also 
applies to A and £; scale bar in B, 25 /im. Photomicrographs showing hybrid- 
ization signal in cells (arrows) in the dorsal raphe (O). ventral tegmental area 
(VTA), substantia nigra, compact part (SNc) and reticular part (SNr) (£), and 
locus coeruleus (LC 0- Scale bar in F (50 fim) also applies to 0. MPB, medial 
parabrachial nucleus; Me5, mesencephalic trigeminal nucleus; m. medial lon- 
gitudinal fasciculus; scp, superior cerebral peduncle. 

tissues. Moderate levels (100 copies/ng cDNA) were expressed 
in stomach, low levels (15-100) expressed in amygdala, kidney, 
lung, and small intestine, whereas trace amounts (<15) were 
expressed in cerebellum, dorsal root ganglia, hippocampus, 
hypothalamus, liver, medulla, pancreas, pituitary, pontine retic- 
ular formation, prostate, skeletal muscle, and spleen. Message 
levels for the other human trace amine receptors were also 
detected in low levels. TA., mRNA was detected only in kidney. 
TA4 and TA 5 mRNA were expressed in kidney and amygdala, 
and TA4 was also detected in the hippocampus. 

A widespread and unique distribution of TAi mRNA was 
revealed in the mouse CNS by in situ hybridization histochem- 
istry (Fig. 5) and the hybridization signal was localized to the 
cytoplasm of neuronal profiles (Fig. 5 A and £). Several brain 
regions exhibited intense labeling specifically, the mitral cell 
layer of the olfactory bulb, piriform cortex, the arcuate, motor, 
and mesencephalic trigeminal nuclei, lateral reticular and hypo- 
glossal nuclei, cerebellar Purkinje cells, and ventral horn of the 
spinal cord Moderate labeling was evident in the frontal, 
entorhmal, and agranular cortices, the ventral pallidum, thala- 
mus, hippocampus, several hypothalamic nuclei, ambiguus, dor- 
sal raphe, and gigantocellular reticular nuclei. Weaker staining 
was visible in the septum, basal ganglia, amygdala, myelenceph- 
alon, and spinal cord dorsal horn. Particularly interesting was the 
moderate expression of TAi mRNA in several monoaminergjc 
cell groups, namely the dorsal raphe (Fig. 50), the locus 
coeruleus (Fig. 5F), and the ventral tegmental area (Fig. 5£). 

Discussion 

We have identified a multigene family of intronless GPCRs and 
have demonstrated that TAj is potently activated by tyramine 
and 0-PEA and displays low affinity for tryptamine, octopamine, 
and DA. An additional member of this family, TA2, is also 



activated by 0-PEA and tryptamine. Although the roles of 
tyramine and octopamine as neurotransmitters acting via stim- 
ulation of G protein-coupled receptors in invertebrate systems 
are well established (6-9), mammalian GPCRs for trace amines 
have not, to our knowledge, been previously reported. The 
present finding lends strong support to a role for trace amines as 
neurotransmitters or neuromodulators in vertebrates. 

One of the most interesting and unexpected findings of this 
study was the discovery of such a large family of highly related 
receptors. In addition to TAi and TA : . we have identified 13 
other related receptors. These additional receptors, TAy-TAn, 
share an unusually high degree of amino acid identity (62-96%). 
For comparison, the human 5-HT receptors share 28-63% 
amino acid identities. The high degree of homology within 
members of the TA family and the tight clustering of the human 
TA receptors on chromosome 6q23.2 suggests that these recep- 
tors evolved relatively recently, after the invertebrate/vertebrate 
split, and makes it tempting to speculate that this region may 
represent a hotspot for gene duplication events. 

Although the degree of homology between receptors within a 
species is extremely high, the degree of amino acid identity 
among orthologues is moderate to low. The rat and human 
orthologues of TA 3 and TAj share a moderate degree of amino 
acid identities (87% and 88%, respectively). However, the 
mouse, rat, and human orthologues of TAi share a relatively low 
degree of homology (87% for rat and mouse, 79% for rat and 
human, 76% mouse and human). This observation suggests that 
although these receptors are relatively recent expansions of the 
genome, they are evolving at a rapid rate. 

Another interesting observation is that a larger number of rat 
receptors has been identified so far as compared with human 
receptors. Four human receptors have been identified (TAi, 
TA 3 , TA4, and TA 5 ), whereas 14 rat receptors have been 
identified (TA1-TA4 and TA^-TA^). To date, only a human 
form of PNR has been reported (29). There are also a large 
number of pseudogenes within the human members of this 
family. The 5-HT 4 pseudogene (31), which we propose should be 
renamed ^TA:, and <t<j?R51 (33) each contain frame shifts 
resulting in premature stop codons, whereas <J<jPR58 lacks an 
amino terminus (33). Although additional human receptors may 
ultimately be identified, the striking difference in the number of 
rat and human receptors suggests that this family may play very 
different roles in different species. 

We have demonstrated a functional response to heterologousry 
expressed TAi in both Xenopus oocytes and a mammalian cell 
system The response in both assays indicates that TAi couples to 
the stimulation of adenylate cyclase through a Go, G protein. The 
human TAi receptor is activated by tyramine and 0-PEA, less 
potently by octopamine, and binds 0-PEA and tyramine with high 
affinity and tryptamine, octopamine, and DA with lower affinity. 
The rat TA2 receptor is activated by 0-PEA and tryptamine, also via 
stim ulation of a Go, G protein. Thus far, we have not demonstrated 
functional responses to tyramine, 0-PEA, tryptamine, octopamine, 
or the classical biogenic amines in COS-7 cells expressing TA3- 
TA15. This finding may be due to poor trafficking to the plasma 
membrane (data not shown), or these receptors may respond to 
related, perhaps as yet unidentified, amines. However, the high 
degree of sequence conservation between the two subfamilies, the 
evolutionary branching analysis, as well as the chromosomal prox- 
imity of the receptors make it very likely that TAy-TAi5 encode 
receptors for trace amines. 

Human TA! mRNA is expressed in low to moderate levels in 
peripheral tissues such as stomach, kidney, and lung, and within the 
CNS appears to be restricted primarily to the amygdala. The 
expression of TAj mRNA is lower than that seen for receptors of 
classical neurotransmitters, consistent with the low levels of trace 
amines relative to other neurotransmitters. All of the human 
members of the TA family are expressed in the kidney, supporting 



S970 I www4KW$.Ofg/cgl/doi/10.107J/pn«$.151105t98 



Borowsfcy ttsl. 



a role in blood pressure regulation and electrolyte homeostasis. 
This may be related to the "cheese effect," wherein dietary-induced 
elevations in tyramine levels in patients taking MAO inhibitors 
results in hypertension and migraine (see ref. 19 for review). 

The expression of TAi mRNA in human amygdala is intrigu- 
ing in light of evidence suggesting a role of trace amines in the 
etiology and /or treatment of depression and anxiety disorders. 
A functional deficiency of 0-PEA and tryptaminc h;is been 
proposed as a potential etiological factor in depression (40-42), 
and increased levels of 0-PEA are associated with the manic- 
phase of bipolar disease (43). Antidepressants that inhibit MAO 
produce proportionally greater increases in trace amines than 
5-HT (2, 3). MAO-B knockout mice have increased levels of 
0-PEA (7-fold higher), but normal levels of 5-HT, NE. and DA 
(5). Interestingly, MAO-B knockout mice show a reduced de- 
crease in mobility in the forced swim test, similar to that induced 
by antidepressants (5). Taken together, these results suggest that 
TAi receptors in the amygdala may be an important site of action 
for trace amines, particularly 0-PEA, in the etiology and treat- 
ment of depression. The expression of mouse TAj mRNA in the 
dorsal raphe, locus ceruleus, and ventral tegmental area indi- 
cates that trace amines may modulate the activity of 5-HT, NE, 
and DA systems and further supports a role for trace amine 
receptors in the regulation of mood. 

Human trace amine receptor genes map to chromosome 
6q23.2, close to SCZD5, a susceptibility locus for schizophrenia 
(6ql3-26 with the greatest allele sharing at 6q21-22.3; ref. 44). 
Because of structural and physiological similarities, /3-PEA has 
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been described as the body's endogenous amphetamine (45, 46). 
Amphetamine produces a paranoid schizophrenic syndrome in 
humans, and chronic treatment with either amphetamine or 
J3-PEA produces a behavioral sensitization in animals (47-51). 
Moreover, numerous clinical studies have demonstrated ele- 
vated urinary levels of p-PEA in schizophrenic patients (52. 53). 
Thus, it will be important to delineate the role of TA receptors 
in the etiology and treatment of schizophrenic. 

Although trace amines have long been thought to be neurotrans- 
mitters, the understanding of their physiology has lagged that of the 
classical biogenic amines, in part, beat use the receptor targets 
remained elusive. The identification of mammalian GPCRs for 
trace amines supports a role for trace amines as bona fide neuro- 
transmitters in vertebrates. The localization of mRNA for three of 
the four human receptors in amygdala lends a potential site of 
action for the postulated role of trace amines in the etiology and/or 
treatment of several affective disorders. Future characterization of 
TA3-TA15 will further enhance our understanding of these recep- 
tors. The discovery of this family of receptors provides a means to 
evaluate the physiological roles of trace amines in higher species and 
their regulation in diseased processes, and to explore potential 
therapeutic applications associated with these receptors. 
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Abstract 



We reoort the identification, cloning and tissue distributions of ten novel human genes encoding G protem-^pled receptors (GPCfe) 
G^GPR^O SwSK. GPR93, GPR94, GPR95, GPR1QL GPR102, GPR103 and a pseudogene, *G/*79. Each nov el orphan 
£ra S u^g customized searches of the GenBank high-throughput genomic sequences database with 

SSLS^ The expressed genes can now be used in assays to detern^e endogenous and pharmac.1^ 

SZITgPR78 shared highLtVdentity with the oGPCR gene GPR26 (56% identity in the transmembrane (T^re^ons). +WR79 
f^^Ll identity^* the B* gene and contained a frame-shift treating 

pseudogene GPR80 shared highest identity with the P2Y, gene (45% in the TM regions), while GPR81, GPR82mi ^ » ™ 
fStir^^^GPCR fJ*HM74 (70%), GPR17 (30%) and P2Y 5 (40%), respectively. Two other novel GPCR g enes^/?$M and 

GPR101 demonstrated only distant identities with other GPCR genes and GPR102 shared identities withGJPR57, i * i^L^ 

fn ^e ™Tgio^). GPR103 shared identities with the neuropeptide FF 2, neuropeptide Y2 and gdarun GalRl 

^ns)NoLm 

^Sn^atabaLs with the GPR82 sequence retrieved an identical sequence m an expressed sequence tag <^^J^^ 
GP/J«2ftornhumancolomc tissue.^ 

an EST wrtially encoding mouse GPR93 from small intestinal tissue. GPR94 was expressed in the frontal cortex, r^tamen and 

5^^^^^ expressed in the human prostate and rat stomach and fetal tissues. GPR101 -ealed n^A ****** 
T^pZm and hypotr^amTSiOi raRNA signals were detected in the cortex, pituitary, thalamus, hypothalamus, basal 
forebrain, midbrain and pons. © 2001 Elsevier Science B.V. All rights reserved. 
Keywordr. Orphan G protein-coupled receptor; Transmembrane; Intrtmless; Pseudogene; Chromosome 



1. Introduction 

As is frequently stated, GPCRs are the largest family of 



Abbreviations: aa, amino acid; BAC, bacterial artificial chromosome; 
EST. expressed sequence tag; GPCRs, G protein-coupled receptors; 
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orphan G protein-coupled receptors; ORF, open reading frame; TM, trans- 
membrane 
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cell surface receptors and are responsible for the signal 
transduction for a diverse variety of ligands including 
nucleotides, biogenic amines, peptides and other small 
molecules (Marchese et al., 1999). GPCRs share a common 
heptahetical topography and these regions are embedded in 
the membrane. These seven transmembrane (TM) regions 
share the most significant levels of receptor identity. As a 
consequence, the majority of DNA sequences encoding 
GPCRs were found using methods dependent on sequence 
homology, mainly PCR or electronic sequence database 
screening (Marchese etal., 1998). GPCRs activated by simi- 
lar ligands share the greatest identities with each other. 
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However, newly discovered GPCRs frequently have oriy 
distant identities with known GPCRs, and these oGPCRs 
are difficult to characterize given the diversity m structure 
of the ligands and effector systems. This problem is 
compounded as we now realize that many endogenous 
ligands remain to be discovered. Increasingly, oGPCR char- 
acterization has utilized methods of 'reverse pharmacol- 
ogy', using the receptor as bait to retrieve ligands from 
tissue extracts. These efforts have identified endogenous 
ligands such as apelin, ghrelin, melanin-concenttating 
hormone, neuromedin U, the orexins. urotensm-H, and 
UDP-glucose (recently reviewed in Lee et al., 2001a; CiveUi 
et al., 2001; Howard et al., 2001). 

Approximately 250 mammalian genes encoding family 
A (or rhodopsin-like) GPCRs have been cloned (Lee et al.. 
2001a). As yet, the total number of cloned GPCRs reported 
in the literature including the secretin and metabotropic 
glutamate-like families of GPCRs falls short of the 
projected 616 GPCR-encoding sequences observed from 
the complete human genome sequence (Venter et al., 
2001). Despite the human genome sequencing efforts, 
much work is still required to identify and clone the 
open reading frames (ORF) encoding the full complement 
of GPCR genes. Inserted into suitable expression vectors, 
these DNA sequences can be used to express the receptor 
in assays which will assist in ligand identification. For 
these reasons, we are continuing in our efforts to identify, 
catalog, compare and map the expression of GPCRs. We 
have recently reported the identification of the H4 hista- 
mine receptor (Nguyen et al., 2001), the cysternyl leuko- 
triene 2 receptor (Heise et al.. 2000) and the oGPCR- 
encoding genes GPR26. GPR57, GPR58 (Lee et al., 
2000), GPR6J, GPR62, GPR63 and GPR77 (Lee et al., 
2001b). We now report the cloning of ten additional 
oGPCR-encoding genes named GPR78, GPR ^r^^ 81t 
GPR82, GPR93, GPR94, GPR95, GPR101, (GPRW ) and 
GPR103 as well as a pseudogene $GPR797VPR78 and 
GPR81 most closely resemble the oGPCR genes GPR26 
and HM74, respectively. GPR80, GPR93 and *GPR79 
shared highest identities with members of the purinoceptor 
family, while GPR82 encoded an oGPCR distantly related 
to the purinoceptor-like oGPCR genes GPR17 and GPR34. 
In addition, two novel genes GPR93 and GPR94 share 
significant identities with each other and with recently 
identified genes encoding the UDP-glucose (Chambers et 
al., 2000) and platelet ADP (P2Y12) receptors (Hollopeter 
et aL, 2001 ; Zhang et al., 2001), which together comprise a 
clustered family of genes on chromosome 3. GPR101 
shared distant identity with amine-binding GPCR genes, 
GPR102 shared identity with the PNRIGPR57IGPR58 
amine receptor-like subfamily of GPCR genes and 
GPR103 shared identities with pepticte-binding receptors, 
including the neuropeptide FF 2, neuropeptide Y2 and 
galanin GalRl receptors. We have also detected mRNA 
transcripts in tissues for GPR78, GPR81, GPR94, 
GPR95, GPR101 and GPRI03. 
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2.1. Database searching 

We queried the expressed sequence tag (EST) and high- 
throughput genomic sequences (HTGS) databases main- 
tained by the National Center for Biotechnology Informa- 
tion with the amino acid (aa) sequences of various GPCRs 
using the TBLAST algorithm (Altschul et al., 1997). 
Retrieved sequences having statistically significant scores 
were further examined. The conceptualized protein 
sequences encoded by these sequences were used to query 
the non-redundant (nr) database to determine whether these 
sequences encoded previously known GPCRs. 

2.2. GPCR gene and cDNA cloning 

GPR78 was originally obtained in two fragments in an 
HTGS sequence from human chromosome 4 (GenBank 
Accession number: AC007104) which encoded the start 
methionine to the third intracellular loop (IC3) and from 
the carboxyl region of TM6 to the stop codon. Based on 
this sequence, two DNA fragments encoding from TM1 to 
TM4 and from TM6 to the stop codon were amplified from 
human genomic DNA. PCR products were extracted with 
phenol and chloroform, precipitated with ethanol and elec- 
trophoresed on a low-melting point agarose gel. Products in 
the expected size range were Ugated into the EcdSN site of 
pBluescript SK(-) (Stratagene, La JoUa, CA) or pcDNAj 
(Invitrogen, Carlsbad, CA) and sequenced. Both fragments 
were observed to be identical to the HTGS sequence, and 
were used as probes to screen a human genomic library as 
previously described (Marchese et al., 1994). Library screen- 
ing retrieved two phage DNA which encoded from the start 
methionine to IC3 and from the carboxyl region of TM6 to 
the stop codon. Primers designed upon TM5 (PI: 5 - 
GCTTCGTGCTGCCGCTG-3') andTM7 (P2: 5'-CGGAG- 
CAGAGAGTACGTG-3') were used to PCR amplify Mara- 
thon ready human fetal cDNA (Clontech, Palo Alto* CA) to 
retrieve a fragment sharing 100% identity in regions of over- 
lap with the HTGS and human genomic phage DNA 
sequences. To obtain the complete intronless ORF of this 
gene, three overlapping segments encoding GPR78 were 
obtained by PCR. Fragment 1 (encoding from the start 
methionine to TM5) was amplified from human genomic 
DNA using two primers (P3: 5'-GCGCCATGGGCCCCG- 
GCGAGG-3', P4: 5 '-GGTGACGGTGTCCATGCGC-3 ). 
Fragment 2 was amplified using primers PI and P2 from 
human fetal cDNA (described above). Fragment 3 (encoding 
from the third extracellular loop and extending 3' of the stop 
codon) was amplified from human genomic DNA using two 
primers (P5: 5'-CTGGCGGAGCTCGTGCCC-3'. P6: 5 - 
G<jCCAGTGCCCTTTCCAC-3'). These DNA fragments 
were joined by two further rounds of PCR. Round one 
consisted of fragments 1 and 2 together undergoing 30 cycles 
of 94°C for 30 s, 50°C for 30 s, and 72°C for 1 min using 
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primers P2 and P3. A second round of PCR amplified an 
aliquot of the first round with fragment 3 using primers P3 
and P6 at similar cycling conditions. The PCR products were 
subcloned into the EcoRV site of pcDNAj (Invitrogen) and 

sequenced. . 

GPR103 was originally obtained in two overlapping frag- 
ments from the EST database (encoding from TM4 to TM7, 
GenBank Accession number. AI307658) and the HTGS 
database (encoding from TM6 to the stop codon, GenBank 
Accession number. AC005961). Analysis of the EST clone 
obtained from the IM.A.G.E. Consortium revealed that this 
fragment encoded the receptor from TM2 to TM7. Based on 
these sequences, two DNA fragments encoding froraTM3 to 
TM7 and from TM7 to the stop codon were obtained by PCR 
from human hypothalamus cDNA (Clontech) and human 
genomic DNA, respectively. PCR products were purified 
and ligated into pBluescript as described above. Sequence 
analysis revealed both DNA fragments to be identical with 
their respective database sequences. The fragment encoding 
TM3 to TM7 was used as a probe to screen a human hypotha- 
lamus cDNA library (Clontech). which retrieved a phage 
encoding GPR103 from the start methionine to TM3 sharing 
100% identity in the overlapping region with the EST derived 
sequence. To obtain a complete intronless ORF of this gene, 
the three overlapping fragments were joined by PCR as 
described above. The PCR products were subcloned into 
the EccRV site of pcDNA 3 and sequenced. 

To obtain DNA encoding other GPCRs, human genomic 
DNA was amplified by PCR using the following oligonu- 
cleotides: if,GPR79 (5'-TGGGGCAGAGGCTGATGCCA- 
TGC-3'. 5 AGCTGG ATGCTC ACCC AACTTGTTC- 3 ), 
GPR80 (S'-GATTCATATTGCC AAACTGAAC-3 ', 5'-C- 
ATCCTGAACATCTTAGGATG-3'). GPR81 (S'-CTAA- 
CGCTCAGATAAGCATCTGTG-3', 5'-GTCACCACTC- 
TATCTTCCTCAGTG-3'), GPR82 (5 ' - AATTCT ATTCT- 
AGCTCCTGTG-3', S'-CTAATAAAGTCACATGAATG- 
C-3'), GPR93 (5'-TTTGGCACGATGTTAGCC-3 , 5'-G- 
TTCAG AGGGCGGAATCC-3 '), GPR94 (5'-AAGCAAT- 
GAACACCACAGTGATGC-3', S'-ATTATCTACGGAA- 
GTCTCATC-3'). GPR95 (5'-AGTTGGGTCTGTAAGG- 
GAACC-3', 5'-TTTATTTACACTTTGTACATATCG-3 ), 
GPRW1 (5'-CTGGCTGTrGCCATGACGTCC-3'. 5 -GC- 
CTT AGAACTAACTTCAAGG-3 '), GPRI02 (5'-CAAA- 
CAACAAACAGCAGAACC-3', 5'-CTTAGTGCTTAAA- 
CTTATTC-3') and P2Y a (S'-AAATAACCATCCTCTC- 
TTTTGTTC-3'. 5'-CGAGTTCTGAACACAAAGAGAT- 
TG-3'). PCR conditions were as follows: denaturation at 
94°C for 30 s, annealing at 50°C for 30 s, and extension at 
72°C for 1 min. PCR products were purified, ligated into 
pcDNA3 or pBluescript vectors and sequenced as described 
above. 

2.3. Northern blot analyses 

Human and rat raRNA were extracted from various 
tissues as described previously (Marchese et al.. 1994). 



Briefly total RNA was extracted by the method of Chomc- 
zynski and Sacchi (1987), and poly (A) + RNA was isolated 
using oligo(dT) ceUulose spin columns (Pharmacia, 
Uppsala, Sweden). RNA was denatured and separated by 
electrophoresis on a 1% formaldehyde agarose gel, trans- 
ferred onto nylon membrane and immobilized by UV irra- 
diation. The blots were hybridized with human GPCR- 
encoding 32 P-labeled DNA fragments, washed with 2 x 
SSPE (3 M NaCl, 0.2 M NaHaPO* 0.02 M EDTA) and 
0.1% SDS at 50°C for 20 min, washed again with 0.1 X 
SSPE and 0.1% SDS at 50°C for 2 h and exposed to X- 
ray film at -70°C in the presence of an intensifying screen. 
In addition, two human Multiple Tissue Northern (MTN~) 
blots (Clontech) were used in GPR78 (Human MTN Blot) 
and GPR95 (Human MTN Blot IV) expression analyses 
according to the manufacturer's instructions. 



3. Results 

3.1. Cloning of GPCR-encoding genes 

A search of the HTGS database wilh the GPR26 sequence 
retrieved a novel human genomic sequence encoding a 
GPCR localized to chromosome 4 (GenBank Accession 
number AC007104). The sequence was encoded on two 
fragments, from the start methionine to the middle region 
of IC3 and from the carboxyl region of TM6 to the stop 
codon. Primers were designed to PCR amplify human geno- 
mic DNA which retrieved fragments encoding TM1 to TM4 
and from TM6 to the stop codon. These fragments were 
used to screen a human genomic library and two phage 
DNA fragments were retrieved. We also used TM5- and 
TM7-specific primers to amplify a DNA fragment from 
human fetal cDNA. The cDNA PCR product revealed 
100% identity in regions overlapping the fragments 
previously obtained from genomic DNA, confirming these 
fragments as segments of the same gene. To obtain the full- 
length ORF, the three overlapping fragments were joined by 
PCR, and this clone was named GPR78 (Fig. 1). GPR78 
encoded a 363 aa protein which shared extensive identities 
in the TM regions with GPR26 (56%) (Table 1). 

A customized search of the HTGS database retrieved a 
human genomic sequence (GenBank Accession number 
AC021773) apparently encoding a novel GPCR. However, 
the sequence (ipGPR79) contained a frame-shift in the ORF. 
Primers were designed and used to PCR amplify this region 
of DNA sequence, and the product sequenced to verify the 
frame-shift in the TM5-encoding region confirming a pseu- 
dogene WGPR79). ^iGPR79 was used to search the HTGS 
database, which retrieved a related GPCR-encoding human 
genomic sequence localized to chromosome 13 (GenBank 
Accession number. ACQ26756). The PCR product contain- 
ing this gene revealed an ORF of 336 aa in length, which we 
named GPR80. A search of the nr database with the 
projected ipGPR79 aa sequence revealed significant idenu- 
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ties in the TM regions with the purinoceptors P2Y 2 (51%), 
P2Y 4 (50%) and P2Y 6 (43%). A similar search with the 
GPR80 sequence revealed greatest identities in the TM 
regions with the purinoceptors P2Y, (45%) and P2Y 4 
(39%) and the cysteinyl leukotrienc receptor CysLT2 
(39%). 



From the HTGS database we retrieved a GPCR-encoding 
sequence contained on a bacterial artificial chromosome 
(BAC) clone localized to chromosome 12q (GenBank 
Accession number AC026331). The PCR amplified product 
(GPR81) encoded a 347 aa protein with identities in the TM 
regions with HM74 (70%), GPR31 (43%) and the purino- 
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Table 1 




oGPCR sequence identities 


Receptor 


ol Identity 


GPR78 


GPR26,49(56) 




SSTR4, 21 (29) 




5HTfi,20(25) 


GPR80 


P2Y b 31 (45) 




P2Y 4 , 29 (39) 




CysLT2, 31 (39) 


GPR81 


HM74,50(70) 




GPR31.30(43) 




P2Y„ 23 (37) 


GPR82 


GPR34, 23 (31) 




GPR17,24(30) 




SSTR2, 23 (30) 


GPR93 


P2Y5.3K40) 




GPR23, 28 (38) 




GPR17,25(36) 


GPR94 


P2Y 12j 46 (57) 




UDP-glucose, 41 (52) 




GPR95, 37 (51) 



Accession number 
AF41U07 

AF4U109 

AF4U110 

AF411111 

AF41U12 

AF411113 



Receptor % Identity 

GPR95 UDP-glucose, 42 (62) 

P2Y|*39(54) 
GPR94, 37 (51) 
GPR101 RE2, <20 (31) 

5HT 1A ,<20(29) 
a lA , <20(29) 
GPR102 FNR,40(42) 
GPR57, 36 (35) 
GPR58, 33 (35) 
GPR103 NPFF2, 31 (38) 

NPY2, 29 (37) 
OalRl, 30 (35) 
i( ( GPR79 P2Yj,37(51) 
P2Y* 37 (50) 
P2Y 6 .33 (43) 



Accession number 
AF11114 

AF411115 

AF411116 

AF411117 

AF41U08 



ceptor P2Y1 (37%). The HM74 sequence was retrieved in 
the same BAG clone as GPR81, indicating a clustering of 
these two genes. 

GPR82 was retrieved using GPR34 (from a search of the 
HTGS database) on a human B AC clone localized to chro- 
mosome 1 (GenBank Accession number. AL161458). The 
GPR82 PGR product encoded a 336 aa protein sharing iden- 
tities in the TM regions with the oGPCRs GPR34 (31 %) and 
GPR17 (30%) and the somatostatin receptor SSTR2 (30%). 

A search of the HTGS database using the cysteinyl leuko- 
triene 2 receptor sequence retrieved a sequence {GPR93) 
encoding a GPCR on a human BAG clone localized to 
chromosome 12 (GenBank Accession number 
AC006087). Hie PCR product obtained encoded a 372 aa 
protein which shared identities in the TM regions with the 
oGPCRs P2Y 5 (40%), GPR23 (38%) and GPR17 (36%). 

A search of the HTGS database retrieved a cluster of three 
genes encoding GPCRs within the same contig localized to 
chromosome 3 (GenBank Accession number AC024886). 
One of these genes was recently reported to encode the 
platelet ADP receptor, P2Y 12 (Zhang et al., 2001; Hollopeter 
et al.. 2001). The P2Y| 2 receptor gene shared homology 
with the other two genes, which we named GPR94 and 
GPR95. A PCR product (GPR94) encoded a 333 aa protein 
which shared identities in the TM regions with the P2Y| 2 
receptor (57%), the UDP-glucose receptor (52%) and the 
receptor encoded by GPR95 (51%). GPR95 encoded a trun- 
cated GPCR, fromTMl to the stop codon, with an intron in 
the ORF evident by the lack of start methionine and the 
presence of an upstream in-frame stop codon. A search of 
the EST database retrieved one EST sequence from human 
testis mRNA encoding GPR95 which also lacked a start 
methionine codon (GenBank Accession number: 



AA758208). This EST was acquired from the LM.A.G.E. 
Consortium, sequenced, and confirmed to share 100% iden- 
tity with the genomic sequence. A more recent database 
search retrieved an EST sequence from a human bladder 
cell line encoding GPR95 with an alternative 5' coding 
region (GenBank Accession number BF028445), confirm- 
ing a complete GPR95 ORF and the presence of an intron 
between the start methionine and TM1 -encoding regions. 
GPR95 encoded a 358 aa protein which shared highest iden- 
tities in the TM regions with the UDP-glucose receptor 
(62%), the P2Yn receptor (54%) and the receptor encoded 
by GPR94 (51%). 

We used the histamine HI and H4 receptor sequences to 
retrieve two GPCR-encoding sequences, GPR101 and 
GPRW2, respectively, from the HTGS database localized 
to chromosome X (GenBank Accession number 
AL390879) and chromosome 6 (GenBank Accession 
number AL357505), respectively. A PCR amplified 
product (GPR101) encoded a 508 aa protein with identities 
in the TM regions with die oGPCR RE2 (31%), the seroto- 
nin 5HT lA receptor (29%) and the a, A adrenergic receptor 
(29%). The GPR102 PCR product encoded a 342 aa protein 
sharing identities in the TM regions with the oGPCRs PNR 
(42%), GPR57 (35%) and GPR58 (35%). 

A search of the EST and HTGS databases with the rat 
leukotriene LT2 receptor sequence retrieved human DNA 
sequences encoding a novel GPCR, GPR103. TTie novel 
receptor sequence was encoded on two overlapping frag- 
ments from a human kidney EST (encoding from TM2 to 
TM7) and from a human HTGS sequence (encoding from 
TM6 to the stop codon). Primers were designed to PCR 
amplify human hypothalamus cDNA and genomic DNA 
which retrieved two fragments encoding TM3 to TM7 and 
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from TM6 to the stop codon, respectively. The TM3 to 
TM7-encoding fragment was used to screen a human 
hypothalamus cDNA library, which retrieved a phage 
DNA fragment encoding GPR103 from the start methiomne 
to TM3 The three DNA fragments revealed 100% identity 
in regions of overlap, confirming these fragments as 
segments of the same gene. To obtain the full-length 
ORF, the three overlapping fragments were joined by 
PCR. and this clone was named GPR103. CPR103 encoded 
a 455 aa protein which shared identities in the TM regions 
with various peptide receptors, including the neuropeptide 
FF 2 (38%), neuropeptide Y2 (37%) and galanin GalRl 
(35%) receptors. 

3.2. Expression analyses 

GPR78 mRNA transcripts were detected in human pitui- 
tary (1.1 kb) and placenta (two signals of 4.2 and l.l kb in 
size) (Fig. 2A.B). However, no GPR78 transcripts were 
observed in human brain or specific CNS regions such as 
the frontal cortex, putamen, thalamus, hypothalamus, amyg- 
dala, hippocampus, pons, medulla and midbrain. In addi- 
tion, human GPR78 transcripts were also absent from 
skeletal muscle, lung, heart, liver, pancreas and kidney. 

To determine the expression distribution of these novel 
oGPCRs, we performed Northern blots in various human 
and rat tissues. <t>GPR79 mRNA transcripts were not 
detected in human brain tissues, including frontal cortex, 



basal forebrain, pituitary, caudate nucleus, nucleus accum- 
bens or hippocampus. In addition. Northern analyses did not 
reveal diGPR79 in rat brain, fetus, liver, spleen or adrenal 
gland tissue. Similarly. GPR80 mRNA transcripts were not 
detected in human brain tissues including the frontal cortex, 
caudate putamen, thalamus, hypothalamus, hippocampus or 

pons tissue. „ , . , 

An mRNA transcript was detected for GPR81 in human 
pituitary tissue, with an absence of signal in frontal 
temporal and occipital lobes of the cortex, basal forebrain, 
caudate nucleus, nucleus accumbens, and hippocampus 
(Fig. 2C). Analyses of GPR82 mRNA expression revealed 
no transcripts in human tissues, including regions of the 
CNS such as the frontal cortex, caudate putamen, thalamus, 
hypothalamus, hippocampus, pons and liver tissue. A search 
of the GenBank database retrieved an EST from cells 
derived from human colonic tissue encoding GPR82 
(GenBank Accession number BF335802). The GPR93 
probe failed to detect mRNA transcripts in human frontal 
cortex basal forebrain, caudate putamen, thalamus, or 
hippocampus. A search of the GenBank database revealed 
an EST encoding GPR93 from human primary tonsil B-cells 
(GenBank Accession numbers: BF975186, BF663176 and 
BF1291 17) and an EST encoding a mouse GPR93 ortholo- 
gue expressed in the small intestine (GenBank Accession 
numbers: AV064817 and AV064680). 

GPR94 mRNA transcripts of 3.2 kb were detected in 
human CNS tissues including the frontal cortex, caudate 
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putamen and thalamus (Fig. 2D). GPR94 was not detected 
in the hippocampus, pons or cerebellum and P2Y I2 was not 
detected in the hippocampus, cerebellum or in peripheral 
liver tissue. GPR95 transcripts were observed in both 
human and rat peripheral tissue. A 1.8 kb signal was 
detected in human prostate, rat stomach and rat fetal tissues 
(Fig. 2E,F). However, GPR95 expression was not observed 
in human thalamus, hypothalamus, hippocampus, pons or 
cerebellum or in rat whole brain tissue. In addition, we 
performed Northern analyses of P2Y l2 transcripts m 
human CNS tissues, which revealed faint signals of 2.7 kb 
in the frontal cortex, caudate putamen, thalamus, hypotha- 
lamus and pons (fig. 2G). GPR101 mRNA transcripts of 9.5 
and 4.2 kb were detected in the caudate putamen and 
hypothalamus, with no expression detected in the frontal 
cortex, thalamus, hippocampus and pons (Fig. 2H). 
GPRW3 transcripts of 4.0, 2.6 and 1.4 kb were detected 
in the thalamus and hypothalamus, with a further 9.5 kb 
signal in the hypothalamus and a 1.4 kb signal in the pitui- 
tary (fig. 21). In addition, GPR103 transcripts were also 
observed in the frontal and occipital cortices, basal fore- 
brain, midbrain and pons (data not shown). 



4. Discussion 

Currently, -350 human GPCRs have been cloned, as 
listed on the GPCRDB (G protein-coupled receptor data- 
base, http://wwwjpcr.Org/7tm/X with -250 representing 
family A (or rhodopsin-like) GPCRs (Lee et al. ( 2001a). 
These receptors total approximately half the predicted 616 
GPCR-encoding sequences contained in the human genome 
(Venter et al, 2001), although the veracity of this total 
number remains to be confirmed. 

The identification of genes encoding the novel GPCRs 
predicts the existence of novel signaling systems leading to 
the discovery of novel ligands, as demonstrated by recent 
reports describing the discovery of apelin (Tatemoto et al., 

1998) , prolactin-releasing peptide (Hinuma et al.. 1998), 
orexin (Sakurai et aL, 1998), melanin-concentrating 
hormone (Bachner et al., 1999; Chambers et aL, 1999; 
Lembo et al., 1999; Saito et aL, 1999; Shimomura et al., 

1999) and urotensin II (Ames et al., 1999; Liu et aL, 1999; 
Mori et aL, 1999; Nothacker et al., 1999) receptor-ligand 
systems. As these GPCR genes (many of which were cloned 
in our laboratory) were used in methods that led to die discov- 
ery and identification of these ligands (Lee et al., 2001a), we 
are continuing to isolate and characterize these novel genes. 

We now report the discovery of ten novel oGPCRs and a 
pseudogene. GPR78 is a paralogue of GPR26, apparent 
from shared identities (56% in the TM regions), a lack of 
asparagine-linked extracellular glycosylation sites, a short 
amino terminus, and similar gene structure (Fig. 1). GPR78 
and GPR26 encoded receptors with shared conserved catio- 
nic arginine and lysine residues in TM6 and TM7, respec- 
tively, two residues recognized to play a role in purinergic 



binding and found only in P2Y receptors (Erb et al., 1995; 
Jiang et al., 1997) (Fig. 1). However, we reported calcium 
mobilization assays of human astrocytoma 1321N1 cells 
and Xenopus laevis oocytes transfected with GPR26 were 
not responsive to nucleoside di- and tri-phosphates (Lee et 
al 2000) We reported high levels of GPR26 expression m 
rn^ny brain regions (Lee et al., 2000), while GPR78 was 
detected only in the pituitary and placenta. However, the 
overall structural homology suggested that GPR78 and 
GPR26 may encode receptors that share a common endo- 
genous ligand. , 
GPR80, GPR81, GPR82, GPR93 and &GPR79 all shared 
identities to P2Y GPCRs orP2Y-like oGPCRs. Previously, at 
least three different nucleotide receptor phenotypes have 
been observed in mammalian tissue, including GPCRs acti- 
vated by adenine nucleotides (e.g. P2Y| and P2Y n ), ™**e 
nucleotides (e.g. P2Ya) and by both adenine and uridine (e.g. 
P2Y 2 and P2Y 4 ) (King et al., 1998). $GPR79 shared closest 
identity with P2 Y 2 and P2 Y 4l even though it does not encode a 
functional GPCR. GPR80 was observed to share highest 
identity with P2Y t , while GPR8U GPR82 and GPR93 shared 
identities with the P2Y-like oGPCR genes HM74, GPR34 
and P2Y 5 , respectively. Some of these oGPCR genes encode 
aa residues conserved amongst the P2Y receptors and shown 
to be involved in purinergic ligand binding (Erb et al., 1995; 
Hoffmann et al., 1999; Jiang et al., 1997) (Fig. 1). While 
expression was not observed for GPR80, GPR82 and 
GPR93 in various CNS regions, GPR81 was observed to 
have an mRNA transcript in the pituitary, suggesting a role 
in neuroendocrine regulation. 

The identities of GPR94 and GPR95 with the genes 
encoding the UDP-glucose and P2Y l2 receptors (>50% in 
the TM regions) indicate a novel subfamily of purinergic- 
like receptors. Previously, the UDP-glucose receptor was 
reported to have a distant sequence homology with the 
P2Y receptors, with an observed widespread tissue distribu- 
tion in human brain and such peripheral tissues as placenta, 
adipose tissue, spleen, intestine, stomach, skeletal muscle, 
lung and heart (Chambers et aL , 2000). Hie identification of 
the platelet ADP (P2Y«) receptor resulted from cDNA 
isolated from rat platelets and human hypothalamus 
(Zhang et al„ 2001; Hollopeter et al., 2001). An alignment 
of these receptors with the GPCRs encoded by GPR94 and 
GPR95 revealed several residues conserved in P2Y purino- 
ceptors (see above) also conserved throughout this novel 
purinoceptor-like subfamily (Fig. 1). Northern analysis of 
GPR94 mRNA revealed expression in various regions of the 
brain, suggesting a neuromodulatory role. In contrast, 
GPR95 mRNA was detected in peripheral tissue (i.e. 
human prostate and rat stomach). 

The receptor encoded by GPR101 appeared to be a distant 
relative of the biogenic amine superf amily of GPCRs, with 
TM identities of -30% with the adrenergic and serotonin 
receptors, as well as the muscarinic and dopamine receptors 
(data not shown). GPR101 mRNA transcripts were observed 
in brain tissue, suggesting the presence of an endogenous 



FEB 02 2004 09:00 FR CISTI ICIST 



613 952 9303 TO 12158648037 P. 10 



D.K. Ue et al. I Gene 275 (2001) 83-91 



90 

amine neurotransmitter ligand. peihaps novel k M 
The receptor encoded by GPR102 shared significant TM 
^nSUth an amine binding receptor-like GPCR f aim* 
(including PNR, GPR57 and GPR58) suggesting they may 
ako share a common endogenous ligand. The recepto 
encoded by GPR103 shared highest identities «* 
neuropeptide receptors. The significant levels of GPRW3 
expression in the brain, particularly in the thalamus and 
hypothalamus, suggest an endogenous peptide hgand, 
perhaps involved in physiological functions such as pain 
modulation and neuroendocrine regulation. 

Interestingly, some of these novel oGPCR genes appear 
to be clustered on various human chromosomes. The Of k« 
gene was localized to chromosome 12q, proximal to the 
closely related oGPCR gene, HM74. The GPR82 gene 
was retrieved together with the GPR34 gene in a human 
BAC clone localized on chromosome 1. Another search ot 
the HTGS database revealed GPR94, GPR95 and P2Y„ 
localized within the same BAC clone on chromosome 3. 
This cluster of genes also includes the UDP-glucose recep- 
tor gene, which together with the more distant P2Y, receptor 
gene further localizes this cluster to chromosome 3q24-25 
(interval D3S1279-1280) (Hollopeter et al. 2001). We have 
previously reported clusters of homologous GPCRs genes, 
including the GPR40 through GPR43 gene cluster (Sawz- 
dargo et al., 1997) as well as the 5-fflV-like VwiogtM, 
tliGPR57, GPR58 and PNR gene cluster (Lee et al., 2000). 
Given the significant sequence similarity of GPR102 with 
PNR, GPR57 and GPR58 and its localization on chromo- 
some 6, GPR102 may be another paralogue member of this 
gene cluster. 

In conclusion, we have identified ten novel GPCR genes 
and a pseudogene. Transcripts for GPR78. GPR81, GPR94, 
GPR95, GPR10J, GPR103 and P2Y n were detected in 
various CNS and peripheral tissues. Given the high levels 
of identity observed within paralogous oGPCR gene clus- 
ters, future efforts will likely discover common endogenous 
tigands for each of these novel GPCR subfamilies. The 
increasing number of oGPCRs that continue to be isolated 
with unique distribution profiles in brain and periphery is 
indicative that the search for novel transmitter ligands 
should be intensified. These efforts have the tremendous 
potential to uncover novel physiological roles for these as 
yet unknown receptor-transmitter signaling systems. 
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gpcr collection Successful Search 



Introduction 

Product Description 

Product Application 

Search and Order 
Human Genes 

Terms and Conditions 

Disclaimer 

Product Manual 

Vector Sequences 

Legal Notices 



Accn 


Clone 


Description 


Cost 


NM_054021 


MG1027_F10 
1 


Homo sapiens G protein-coupled 
receptor 101 (GPR101), mRNA 


$80< 



This price is valid only in the USA and Canada 
Add to' Shopping Cart 



1 The 5' and 3' sequences of our molecular clones have been matchec 
to an accession number as a point of reference. Note that the completi 
sequence of our molecular clones may differ from the sequence 
published for this corresponding accession number, e.g., by 
representing an alternative RNA splicing form. 

>MG1027_F10 OriGene 5' read for NM_054021 

GCTTCTGGCTGTTGCCATGACGTCCACCTGCACCAACAGCACGCGCGAGAGTAACAGCAC 
CCACACGTGCATGCCCCTCTCCAAAATGCCCATCAGCCTGGCCCACGGCATCATCCGCTC 
AACCGTGCTGGTTATCTTCCTCGCCGCCTCTTTCGTCGGCAACATAGTGCTGGCGCTAG r . 
GTTGCAGCGCAAGCCGCAGCTGCTGCAGGTGACCAACCGTTTTATCTTTAACCTCCTCG r 
CACCGACCTGCTGCAGATTTCGCTCGTGGCCCCCTGGGTGGTGGCCACCTCTGTGCCTC: 
CTTCTGGCCCCTCAACAGCCACTTCTGCACGGCCCTGGTTAGCCTCACCCACCTGTTCGC 
CTTCGCCAGCGTCAACACCATTGTCTTGGTGTCAGTGGATCGCTACTTGTCCATCATCCi 
CCCTCTCTCCTACCCGTCCAAGATGACCCAGCGCCGCGGTTACCTGCTCCTCTATGGCA( 

ctggattgtggccatcctgcagagcactcctccactctacggctggggccaggctgcct: 

TGATGAGCGCAATGCTCTCTGCTCCATGATCTGGGGGGCCAGCCCCAGCTACACTATTC: 
CAGCGTGGTGTCCTTCATCGTCATTCCACTGATTGTCATGATTGCCTGCTACTCCGTGG'. 
GTTCTGTGCAGCCCGGAGCAGCATGCTCTGCTGTA 



View the IMCBI page for this accession number (in new window) 
List InterPro domains that OriGene associates with this accession 
number 



Search for another accession number 
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Ultimate™ ORF Browser: 

Advanced Search for Ultimate™ ORF Clones 



Search By Sequence I Drowse By Gene Ontology 



33 total records for G -Protein Coupled Receptors 



Buy Clone ID Species Definition 



Gene 
Symbol 
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IOH3294 


Human 


complement component 5 receptor 1 (CSa Ugand); complement component- 
5 receptor-2 (CSa llgand) 


C5R1 


r 


IQHJL2fili 


Human 


purtnerglc receptor P2Y, G -protein coupled, 11 


P2RY11 


r 




Human 


done HGC: 33224 IMAGE: 5267661, mRNA, complete cds. 


RDC1 


r 


IQHJL1Q32 


Human 


Simitar to putative nuclear protein ORF1-FL49 


ORF1-FL49 


r 


mxxm 


Human 


gtycoproteln lb (platelet), alpha polypeptide 


GP1BA 


r 


IQH1987 


Human 


tachykinin receptor 1 tsofbrm short; NK-1 receptor; Tachykinin receptor 1 
(substance P receptor; neurofclnln-i receptor); tachykinin 1 receptor 
(substance P receptor, neurokinin 1 receptor); neurokinin 1 receptor 


TACR1 




tQMllOSfi 


Human 


Shnttar to POSSIBLE GUSTATORY RECEPTOR CLONE PTE01 


toaisu: 


c 


IQH9916 


Human 


coagulation factor n (thrombin) receptor-ike 1 


F2RU 


r 


IOH9624 


Human 


vasoactive Intestinal peptide receptor 2 


VTPR2 


r 


BttilfifiZi 


Human 


endotheun receptor type A 


EONRA 


r 


1QH23637 


Human 


Stouter to parathyroid hormone receptor 1, done MGC:34S62 
IMAGE:518088S, mRNA, complete cds. 


PTHR1 


r 


tOHissea 


Human 


Ouffy blood group 


FY 


r 


1QH458S 


Human 


cnoiecy/swunin d iw*ywf 


CCKDR 



r U(MMA endothelial tfrterenttatton, lysophosphatldlc add G-proteln-coupled receptor, 

lOHiiott Human 4; Gprctem-coupled receptor, tf A re^^ EDG4 
receptor EDG4 

j— IQH10866 Human ^^J^^^^^ precursor; leukocyte antigen CQ97; seven-span Q97 

r,^™,, Human Ibrmyl peptide receptor-ike 1; ipoxln A4 receptor (formyt peptide receptor 

™22u2 related) 

r" tOH77M9 Mwmtfl eo^enomedulto receptor ADMR 

r tOHi3239 Mum * n super conserved receptor expressed In brain J SRE63 
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Customer Service: 1800 7861236 

Technical Support: 318 219 1123 
Fax: 318 798 1849 

lnfo(5)fabgcnnix.com 
www.fabgcnnix.com 



Nnw.1 Omhan retinal r-nmldn couplf H R«™tnr (OPCR-7^ selective antibodies 

Anli.r,PC:R-75A-i^i« (r,PCR7 <-«inp r,prR75-l(ll HP and OPCR7S-* I2AP) 

^SS ; irorr ^'ibo^^ retinal dystrophy (DHRD), an inherited retinal degenerate d.sease tha, maps to chromosome 

2 P 16 ( 1 )- ,i to vr%n (^ac\ ominn nriHO nrotein that is primarily expressed in human retinal pigment epithelium 

^"SpCR.^i™,^ 



GPCR75-101AP 



GPCR75-112AP 



PC-GPCR75 



P-GPCR75 



Host Species 



Rabbit 



Rabbit 



Rabbit 



Nature 



Polyclonal antisera 



Affinity purified IgG 



Affinity purified IgG 



N/A WB positive control 



Cross reactivity 



R.M.H 



R.M.H 



R.M.H 



Rat 



n/a 



Quantity 



100 ug 



100 ug 



For 5 App 



250 ug 



150 ul 



60 ul 



inquire 



3 



$ 225.00 



$ 75.00 



$ 65.00 



Immunogen: Synthetic cyclic peptide (GPCR75-101AP - PNATS LHVPHSQEGNSTS-emide; GPCR75-112AP 

STSUJBOLQDUHTATLVT&wnidc). . 
Concentration: GPCR7S-101AP; GPCR-1 12AP 1«0 <«««■- 0.7M * mgM in 50% b fw 

Applications: A^OKWtww^ 

Reactivity: TWs«n1ibodyae«eeUastafle78U^^ 78kD.OP.75 

PtOtOOOls: Standard protoed for r^applk^ 

pS^~- FabOccntx tot Inc. strongly weo««eaditortaig«toi»to 

optimfa* condition, ^^J^^^^^ 0 ^ ^ orth^rosalAnerthMate as 

Form/Storage: 

properties. For ton^teim ttoragej of «*£odies. ^^^■^^J^^^^ %t moif solutions 
ISeTS W^rSfb7«tered tr^h 0>«* fitter after eve* use for lor^erm storage, 
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78 kDa Orphan Receptor-75 
in human RPE cells. 
Antibody GPCR-100P 
(1:400) 



Tarttclin E. E~, Kfischner u a, Beumpwu, o«u. *. -/ ~T ,^,80 000 

C Gregory-Evans C Y. Btochem. Blophys. Res. Co^ua 2«U7^80. |999_ 
2 FaroSqui. I!m, Brock. W. J. A. Hamdi, Prasad. C. (1991) I. Neurochcm. 57. 
• For ««rs who n-y retire Urge amounts of OPCR75-1I00P or GP«^0 1 AP pica* ^' b - tbulk ™ te ™' ^^.^oSF.OO.Z^v.O.OO 
This Product it for Research Use Only «nd is NOT intended for asc in humans or cimical diagnosis. 
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Rat Taste Receptor 2 (TR2) Antibodies 

Rai Taste Receptor 2 (TR2) Antibodies 



Cat. U TR21-P, Rat TR2 Control Peptide # 1, SIZE: 100 ug/100 ul 
FORM: (E Soln (E Lyophilized Lot # 3 1 1 3P 

Cat # TR21-S, Rabbit Anti-rat TR2 antiserum # 1, SIZE: 100 ul neat antiserum 
FORM: <E Soln GE Lyophilized. Lot # 38889S 

Cat. # TR21-A, Rabbit Anti-rat TR2 Ab # 1 (affinity pure) SIZE: 100 ug 
FORM: (E Soln (E Lyophilized. Lot # 38889A 



Higher vertebrates are believed to possess at least five basic tastes: Sweet, bitter, sour, salty, and unami 
(the taste of monosodium glutamate). Taste receptor cells that may selectively reside in various parts of 
the tongue and respond to different tastants and perceive these taste modalities. Circumvallate papillae, 
found at the very back of the tongue, are particularly sensitive to biter substances. Foliate papillae, found 
at the posterior lateral edge of the tongue, are sensitive to sour and bitter. Fungiform papillae at the front 
of the tongue specialize in sweet taste. 

Recently, two novel taste receptors, TR1 and TR2, have been cloned with distinct topographical 
distribution in taste receptor cells and taste buds. TRs are members of a new group of 7 TM domain 
containing GPCR distantly related to other chemosensoiy receptors (Ca+-sensing receptor (CaSR, a 
family of putative hormone receptor (V2R), and metabotropic glutamate receptors). TR1 is expressed in 
all fungiform taste buds, whereas TR2 localized to the circumvallate taste buds. Both receptors do not 
co-localize with gustducin. 

Source of Antigen and Antibodies 

TR1 (rat 840 aa) and TR2 (tat 843 aa) share -40% homology with each other, and -30% with CaSR, 
and 22-30% with V2R pheromone receptors and mGLURs. Rat TR are 7 TM domain containing protein 
with an extra long N-terminal, extracellular domain (1). A 19 AA Peptide (designated TR21-P; control 
peptide) sequence near the C-terminus of rat TR2(1) was selected for antibody production. The peptide 
was coupled to KLH, and antibodies generated in rabbits. Antibody has been affinity purified using 
control peptide-Scpbarose. 

Form & Storage 

Control peptide Solution is provided in PBS, pH 7.4 at 1 mg/ml (100 ug/100 ul). Antiserum is supplied 
as neat serum (100 ul soln or lyophilized). Affinity pure antibodies were purified over the peptide- 
Sepharose column and supplied as 1 mg/ml soln in PBS, pH 7.4 and 0.1% BSA as stabilizer (100 ul in 
solution or Lyophilized). 

The peptides and antibodies also contain 0.1% sodium azide as preservative. Lyophilized products 
should be reconstituted in 100 ul water and gently mixed for 1 S min at room temp. All peptide/antibody 
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received in solution or 

reconstituted from lyophilized vials should be stored frozen at -20oC or below in suitable aliquots. It is 
not recommended to store diluted solutions. Avoid repeated freeze and thaw. 

Recommended Usage 

Western Blotting (1:1 K-5K for neat serum and 1-10 ug/ml for affinity pure antibody using ECL 
technique). 

ELISA: Control peptide can be used to coat ELIS A plates at 1 ug/ml and detected with antibodies (1:10- 
50K for neat serum and 0.5-1 ug/ml for affinity pure). 

Histochemistry & Immunofluorescence: We recommend the use of affinity purified antibody at 1-20 
ug/ml in paraformaldehyde fixed sections of tissues (1). 

Specificity & Cross-reactivity 

The 19 AA rat TR21-P control peptide is specific for rat TR2 It has no significant sequence homology 
wUh TRlo guaducin or pheromone receptors. Antibody cross-reactiy ity in various species has not 
teen s£died. The TR21-P control peptide is available to confirm specificity of antibodies. 

References: 

1 . Hoon MA et al (1999) Cell 96, 541-555; Lindemann B (1999) Nature Med. 5, 381-382 

"Moat Antisera" are the unpurified antiserum and it is suitable for ELISA and Western 
"Affinfty pur? an^es ha U ve been over the antigen-affinity oo.umn and recommended for 

tSSKS^ for Western as they are very short peptides. They are 
Intended for ELISA or antibody competition studies. 
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